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A  new  nonparametric  sign  statistic  based  on  interdirections  is  proposed  for  testing 
whether  two  sets  of  variates  are  independent.  This  interdirection  quadrant  statistic 
reduces  to  the  sample  coefficient  of  medial  correlation  (or  quadrant  statistic)  when 
the  two  sets  of  variates  have  one  variable  each.  It  has  an  intuitive  invariance  prop- 
erty for  the  present  problem  and  has  a  limiting  chi-square  distribution  under  the  null 
hypothesis  of  independence  when  each  set  of  variates  is  elliptically  symmetric,  mak- 
ing it  asymptotically  distribution-free.  The  new  statistic  is  compared  to  the  classical 
normal  theory  competitor,  Wilks'  likelihood  ratio  criterion,  and  a  component-wise 
quadrant  statistic.  Using  a  novel  model  of  dependence  between  the  sets  of  variates 
enables  the  computation  of  Pitman  asymptotic  relative  efficiencies  (ARE's).  The 
Pitman  ARE's  indicate  that  the  interdirection  quadrant  statistic  compares  favorably 
to  Wilks'  likelihood  ratio  criterion  when  the  sets  of  variates  have  heavy-tailed  dis- 
tributions and  is  uniformly  better  than  the  component-wise  quadrant  statistic  when 
the  sets  of  variates  are  spherically  symmetric.  A  simulation  study  demonstrates  the 
relative  performances  of  the  three  competitors  as  well  as  some  other  statistics  often 
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found  in  commercial  software  packages.  The  results  indicate  that  the  interdirection 
quadrant  statistic  performs  better  than  the  others  for  heavy-tailed  distributions  and 
is  competitive  for  distributions  with  moderate  tail  weights.  Finally,  several  appli- 
cations of  the  interdirection  quadrant  statistic  are  illustrated,  with  comparisons  to 
its  competitors.  In  one  example,  the  interdirection  quadrant  statistics  resistance  to 
outliers  is  demonstrated. 
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CHAPTER    1 
INTRODUCTION 


1.1      Bivariate  Tests 


The  question  of  whether  the  pair  of  random  variables  (X,  Y)  are  stochastically 
independent,  based  on  the  random  sample  {(A',,  Yt),  i  =  1, . . .  ,n}  from  a  continuous 
distribution  with  density  function  h(x,y),  has  generated  vast  amounts  of  research 
over  the  past  century.  After  the  "discovery"  of  the  correlation  concept  by  Calton 
(1888),  many  bivariate  measures  of  correlation  were  invented  to  explore  the  nature  of 
the  dependency  between  X  and  Y .  Examples  include  the  classical  Pearson  product 
moment  correlation  coefficient  (Pearson,  1896) 

jjXi  -  X)(Yi  -  Y) 

, t=i 

7'   ~~       /  v    1  12  ' 

/    n  n  \  1 1  L 


Y&i-z?Y]yi-n 


and  numerous  rank  correlation  statistics  based  on  the  ranks  of  the  Xi's  (Y^s)  denoted 

by  R\, . . . ,  Rn  (Q\, . . . ,  Qn)  such  as  Spearman's  rho  (Spearman,  1904) 

12      A/,,       n  +  l\/„       n  +  1 
i=i 


-^t^-^)^-^) 


Kendall's  tau  (Greiner,  1909;  Kendall,  1938) 

1 

*  =  —, rr  H ssn  (R<  ~  Ri)ssa  (Qi  -  Qj) . 

n(n-  l)  tZj 

and  the  sample  coefficient  of  medial  correlation,  or  more  simply  the  quadrant  statistic 
(owing  to  the  fact  that  it  is  based  on  the  number  of  points  in  the  four  quadrants 


defined  by  the  marginal  medians)  (Blomqvist,  1950) 

1=1 


'/  =  -  J2sku  [R*  — —)  ssn  [Qi 


n,=i 


where  X  ( Y)  is  the  median  of  the  X;'s  (Vi's).  These  measures  can  be  used  to  conduct 

hypothesis  tests  for  independence  of  X  and   Y.     Many  results  are  already  known 

about  the  comparison  of  such  (suitably  normalized)  bivariate  tests  of  independence, 

and  for  various  models  of  dependence,  asymptotic  relative  efficiencies  (ARE's)  have 

been  computed.  We  present  a  brief  summary  of  these  results. 

Farlie  (1960)  introduces  the  bivariate  distribution  function 

H(x,y)  =  F(x)G{y){\  +  aA(x)B(y)},  (1.1) 

where  F(x)  and  G(y)  are  the  known  marginal  distribution  functions  of  X  and  Y. 
For  (1.1)  to  be  a  bonafide  cdf,  d(F(x)A(x))/F(x),  d(G(y)B(y))/G(y),  A(x),and  B(y) 
need  to  be  bounded,  with  y4(oo)  =  B(oc)  =  0.  Clearly  a  is  a  measure  of  depen- 
dence, with  a  =  0  denoting  the  independence  of  X  and  Y .  The  legal  range  of  a  is 
determined  by  the  greatest  and  least  values  of  the  product  {d(F(x)A(x))/ F(x)}  x 
{d(G(y)B(y))/G(y)}  over  all  variation  of  x  and  y.  Farlie  (1961)  then  derives  the 
asymptotic  efficiency  for  a  generalized  correlation  coefficient  V  devised  by  Daniels 
(1944).  To  each  ordered  pair  of  X's,  (X{,  Xj),  assign  a  score  arj  such  that  a^  =  —  a]X 
and  an  =  0.  In  a  similar  way  assign  scores  &,_,  using  the  K's.  The  definition  of  T  is 
then 


r  = 


!  =  1    J  =  l 


I       71  71  11  71  \ 


Gamma  includes  many  well-known  correlation  coefficients  as  special  cases,  among 


w 


hich  are  p,  r,  and  q' .  To  get  r  for  example,  put  a,{j  =  sgn  (/?,  —  /?_,). 


Farlie  shows  that  for  alternatives  with  the  distributional  form  (1.1),  there  is  always 
some  coefficient  of  Daniels'  family  of  coefficients  that  is  fully  efficient  (i.e.,  the  Pit- 
man ARE  with  the  maximum  likelihood  estimator  is  unity).  We  emphasize  that  the 
ARE's  axe  determined  by  letting  a  — *■  0  as  n  — ►  oo  and  hence  are  Pitman-type  ARE's. 
Here  a  is  the  parameter  in  (1.1)  and  not  the  level  of  the  tests.  Table  1.1  summa- 
rizes his  results  when  H(x,y)  is  the  bivariate  normal  distribution.  Note  that  in  this 
case  the  maximum  likelihood  estimator  is  r,  and  a  is  the  "true"  correlation  between 
X  and  Y .  Farlie  further  notes  that  his  results  agree  with  those  obtained  by  Blomqvist 
(1950)  for  q'  and  by  Stuart  (1954)  for  p  and  r. 

Table  1.1.    Pitman  ARE's  Reported 
by  Farlie  Under  Bivariate  Normality 

ARE(/o,r)  9/tt2 

ARE(r,r)  9/tt2 

ARE(?',r)  4/tt2 

Konijn  (1954)  considers  the  model  of  dependence  X  =  \\U  +  XiV  and  Y  = 
X3U  +  X4V,  with  U  and  V  independent  and  A1:  A2,  A3,  and  A4  constants.  The 
independence  case  is  produced  by  setting  A2  =  A3  =  0.  When  U  and  V  have  the 
same  marginal  distribution  G,  he  computes  Pitman  ARE's  for  several  choices  of  G 
by  letting  A2  — ►  0  and  A3  — >  0  as  n  — >  00.  His  results  are  summarized  in  Table  1.2. 

Table  1.2.  Pitman  ARE's  Computed  by  Konijn 

Normal     Uniform     Parabolic      Laplace 


ARE(p,r) 

9/tt2 

1 

0.8569 

1 .2656 

ARE(r,r) 

9/Y2 

1 

0.8569 

1.2656 

ARE(9',r) 

4/tt2 

1/4 

0.3164 

1 

Hajek  and  Sidak  (1967)  use  the  model  X  =  X*  +  AZ  and  Y  =  Y*  +  AZ 
where  X*,  )*,  and  Z  are  mutually  independent  with  densities  /,  g,  and  m,  re- 
spectively, and  A  is  a  positive  constant.  They  require  /  and  g  be  known  and  that 
0  <  Var[Z]  <  oo.  They  focus  on  determining  locally  most  powerful  rank  tests.  When 
/  and  g  are  of  logistic  type,  the  locally  most  powerful  rank  test  is  based  on  p,  and 
they  indicate  that  q'  is  the  resulting  statistic  when  using  approximate  scores  in  the 
locally  most  powerful  rank  test  for  /  and  g  of  double  exponential  type.  They  ac- 
knowledge that  they  were  unable  to  derive  Pitman  ARE's  as  A  — >  0,  but  Puri  and 
Sen  (1971)  successfully  complete  the  work  using  a  model  that  encompasses  Hajek  and 
Sidak's  as  a  special  case.  The  general  multivariate  version  of  this  model  is  discussed 
in  Section  3.3.  In  the  bivariate  case,  denote  the  joint  density  function  of  A'  and  Y 
by  h&(x,y)  =  J  f(x  —  Az)g(y  —  Az)m(z)dz.  Let  X,  Y,  and  Z  each  have  unit  vari- 
ances and  define  a  sequence  of  alternatives  by  A„  =  n_1/,4A0.  They  show  that  the 
Pitman  ARE  of  the  statistic  Tn  =  n-1  Yla=i  J(Fn(Xa)) J{Gn(Ya))  compared  to  r, 
where  J  is  a  standardized  score  function  and  Fn  and  Gn  are  the  empirical  cdf's  of 
A'  and  V,  respectively,  is  given  by 


ARE(rn,r)  =  lim 


A,;1  jj  J(F(x))J(G(y))dH*n(x,y)}2 . 


As  an  example,  using  the  score  function  Jo  defined  as 

1         if    1/2  <  u  <  1, 
Jo(u)  =  {0        if  u=  1/2, 

-1      if   0  <  u  <  1/2 


(1.2) 


and  assuming  H&n  is  bivariate  normal,  we  have  that 

ARE((/',r)  =   lim  h/w  ■  A"1  sin-1(A,l/(l  +  An))]2  =  4/tt'2. 


In  a  series  of  exercises,  Puri  and  Sen  also  investigate  a  slight  variation  of  the  Hajek 
and  Sidak  model  in  the  bivariate  case.  They  use  X  =  (1  —  A)A'*  +  AZ  and  Y  = 
(1  —  A)V*  +  AZ  with  X*,  V*,  and  Z  mutually  independent.    Their  modifications, 


in  conjunction  with  the  model  used  by  Konijn,  were  admittedly  the  inspiration  for 
Model  1  considered  in  Section  3.2. 

1.2     Multivariate  Tests 


Our  interest,  however,  lies  in  the  multivariate  extension  of  this  problem.  Specifi- 
cally, instead  of  testing  whether  two  real- valued  random  variables  are  independent,  we 
consider  testing  whether  two  vector-valued  random  variables  are  independent,  where 

the  dimensions  of  the  vectors  need  not  be  the  same.  Let  {Xt  =  (X]  ,X\  )',  i  = 
l,...,n}  be  a  random  sample  of  n  pairs  of  vectors  from  a  continuous  distribution 

with  density  function  fx  (#'  ,  #v  ),  where  X\  is  r^  x  1  and  has  marginal  den- 
sity fk  fx'fc)j,  k  =  1,2  (further  occurrences  of  the  index  k  generally  mean  the  state- 
ment holds  for  both  k  =  1  and  k  =  2).  Further,  assume  that  f/.  fa;**'"']  represents  a 
distribution  that  is  elliptically  symmetric  and  centered  at  the  r/t  x  1  vector  6^  (i.e., 
/*  (x^J  is  a  function  of  ell  =  (x^  —  flfcj'EjjT^a;^  —  9k)  alone,  where  E*  is  a  positive- 
definite  matrix.)  This  is  a  common  assumption  in  multivariate  theory  as  it  implies 
the  simple  structure  that  observations  on  ellipsoids  defined  by  ell  =  constant  are 
equally  likely.  The  elliptically  symmetric  class  is  sufficiently  general  to  accommodate 
a  wide  variety  of  different  distributions,  so  that  this  assumption  is  not  overly  restric- 
tive.   Without  loss  of  generality,  we  also  take  ri  <  r2.    We  are  interested  in  testing 

H0:  fx  (z(1),:e(2))  =  fx  (x(1))  f2  (x{2))  versus  Hx  :  X{>]  and  if  are  correlated. 
1.2.1      Likelihood  Ratio  Criterion 

Wilks  (1935)  derived  the  likelihood  ratio  criterion  for  testing  H0:  £12  =  0  when 
the  X,'s  are  multivariate  normal  with  mean  vector  ft  =  (fi[,fi2)'  and  covariance 

matrix  X  =   (  v,"  v12  ).    The  normal  distribution  is  unique,  in  that  independence 


yn/2 


is  completely  determined  by  the  form  of  the  covariance  matrix.  Thus  Xr2  —  0  is 
equivalent  to  the  null  hypothesis  of  independence.  If  A  =  Yl'a=\{Xa  ~  X)(Xa  —  X)', 
and  we  partition  it  into  AtJ  =  TZ^X®  -  X^)(X^  -  X(j))\  i,j  =  1,2,  then  the 
criterion  is  expressed  as 

\Au\\A2-i\ 

—  \T     -  A~x  A     A'1  A'   I"/'2 

—  17"     —A'    4""1  A     4-1ln/2 

—  |-»r2  -^  12-^*11  -^-1 2-^*22  I 

The  criterion  also  has  a  convenient  limiting  distribution  under  the  null  hypothesis,  in 

that  -2  log  Vn'2  =  -nlog  V  -i  \2nT2. 

Muirhead  (1982)  shows  that  under  the  group  of  transformations  described  by 

Q  =  {g(B,c)  \  g(B,c)(X)  =  BX  +  c} 

where  B  =  B\  (f)  B2^  {B^  nonsingular  r/,:  x  rjt)  and  c  £  JRri+r:,  a  maximal  invari- 
ant is  the  set  of  sample  canonical  correlations  (pi,. . .  ,pTl)  where  p\  >  ■  •  •  >  p2 
are  the  eigenvalues  of  S  =  A^  A^^A^  A'l2.  The  usefulness  of  the  canonical  corre- 
lations arises  from  the  fact  that  through  linear  transformations  on  X'1'  and  X^2', 
the  correlation  structure  implicit  in  the  covariance  matrix  X  can  be  reduced  to  a 
form  involving  only  these  parameters.  In  fact,  the  null  hypothesis  can  be  restated 
as  Ho :  pi  =  •  •  •  =  pri  =0.  Note  that  the  nonzero  eigenvalues  of  A\2A^  AViA22  are 
identical  to  the  eigenvalues  of  S,  so  that  the  criterion  is  (as  one  would  expect)  invari- 
ant to  the  labeling  of  the  two  partitions  of  X.  Since  we  can  rewrite  V  as  n[=i(l  — Pi)- 
this  shows  that  V  is  invariant  under  the  group  Q.   When  r\  is  one,  the  lone  sample 

canonical  correlation  is  just  the  multiple  correlation  coefficient  between  the  X\    s  and 

the  X \    s.  Then  if  r2  is  also  one,  V  is  just  (1  —  t-2),  the  sample  coefficient  of  alienation. 
Thus,  the  likelihood  ratio  criterion  is  the  sample  vector  coefficient  of  alienation  and 


a  multivariate  extension  of  the  bivariate  test  of  independence  based  on  r.  The  in- 
variance  of  V  under  the  group  Q  is  an  important  property  since  it  implies  that  a  test 
using  V  will  not  depend  on  the  underlying  covariance  structure  of  either  the  X)  s 
or  Xt  's.  Other  statistics  that  are  also  functions  of  the  eigenvalues  of  S  (squares  of 
the  sample  canonical  correlations)  and  hence  invariant  under  Q ,  will  be  described  in 
Chapter  4. 

1.2.2     Component-wise  Quadrant  Statistic. 

A  nonparametric  approach  to  the  problem  is  explored  in  Puri  and  Sen  (1971), 
where  a  class  of  association  parameters  (and  their  sample  counterparts)  based  on 
component-wise  ranking  is  defined.  The  statistic  they  propose  is  computed  using  the 

elements  of  the  matrix  T  =  (  £,n  £,12  ).  The  elements  of  T!?  =  {T^l)}riXr    are  given 

\  I  21    i  22  '  l*J  J 

by 


S,S]~ntAn  +  l)     \n+l) 


Here,  R[kJa  is  the  rank  of  X^a  among  X;  [, . . .  ,X^\  and  J  represents  an  arbitrary 
(standardized)  score  function.  Puri  and  Sen  base  their  test  of  independence  on  the 
statistic 

sJ. 


11 1 1-4  22 1 

which  is  clearly  analogous  to  V  except  the  matrix  T  is  used  instead  of  the  matrix  A. 

They  also  show  that  under  the  null  hypothesis,   —n  log  SJ  — >  xl  T2->  so  that  their 
procedure  is  a  natural  competitor  of  —  n\og  V . 

A  score  function  that  will  be  of  interest  to  us  is  Jo,  defined  in  (1.2).    With  this 
score  function,  the  elements  of  Tr2  are 
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so  that  S  °  is  a  multivariate  extension  of  q' .  A  problem  with  statistics  based  on 
the  component-wise  ranking  scheme  is  that  they  do  not  have  the  desirable  invariance 
property  exhibited  by  V.  Thus  S  °,  using  component-wise  ranking,  fails  to  be  invari- 
ant under  the  group  Q,  and  its  performance  will  be  influenced  by  the  (presumably) 

unknown  underlying  covariance  structure  of  the  Xt  ''s  and  X \     s. 

In  Chapter  2,  we  propose  a  nonparametric  competitor  to  V  that  like  SJ°  is  a 
multivariate  extension  of  q',  but  maintains  the  important  invariance  discussed  in  Sec- 
tion 1.2.  We  derive  the  limiting  null  distribution  in  the  case  when  #i  and  9-2  are 
known.  Sufficient  conditions  for  asserting  that  the  limiting  distribution  is  unchanged 
when  Q\  and  02  are  replaced  by  the  estimates  6\  and  #2  are  given  as  well.  In  Chap- 
ter 3,  we  introduce  a  new  model  of  dependence,  with  computations  leading  to  the 
limiting  distributions  of  the  competing  statistics  under  a  contiguous  sequence  of  al- 
ternatives based  on  this  model.  Pitman  ARE's  are  calculated  using  these  limiting 
results.  In  Chapter  4  we  present  a  Monte  Carlo  study  that  corroborates  empirically 
the  results  of  Chapter  3,  and  in  Chapter  5,  we  apply  the  new  test  and  its  competi- 
tors to  some  real-world  data.  We  conclude  with  some  general  comments  and  indicate 
potential  areas  of  future  research  in  Chapter  6. 


CHAPTER    2 
INTERDIRECTION  QUADRANT  STATISTIC 


2.1      Definition 

Let  9\  based  on  X\  , ...,  X^}  and  02  based  on  X\  ,...,X\P  be  equivariant 
(under  the  group  Q)  estimators  of  6\  and  62,  respectively,  such  that  both  (#i  —  6^) 
and  (62  —  02)  are  Op(n-1'2).  The  interdirection  quadrant  statistic  is  defined  as 

n  ,=-.,=1 

where  ptC-X'i  t-^j  5  7a.)  is  the  interdirection  proportion,  first  defined  by  Randies 
(1989),  between  {X\  —  7fc)  and  (.Xj-  '  —  7fe).  We  now  describe  how  to  calculate  this 
interdirection  proportion.  If  we  let  Z\  =  X\  —  7fc,  then  the  number  of  hyperplanes 
defined  by  the  origin  and  rk  -  1  other  Z(k)s  (not  zf]  or  Zf])  such  that  Z\k)  and  Zf] 
are  on  opposite  sides,  is  called  the  interdirection  count.  To  get  the  interdirection 
proportion,  divide  by  the  total  number  of  hyperplanes  considered.  Note  that  this 
is  the  simpler  and  more  natural  divisor  given  by  Randies.  He  makes  a  small  sample 
adjustment  in  this  divisor  so  that  his  interdirection  sign  test  is  equivalent  to  Blumen's 
test  in  the  bivariate  setting.  The  interdirection  count  measures  the  angular  distance 
between  Z\  and  Z}  relative  to  the  origin  and  the  position  of  the  other  Z^'s. 
Randies  showed  that  under  a  wide  class  of  population  models  (distributions  with 

elliptical  directions,  a  superset  of  elliptically  symmetric  distributions),  when  the  X'  's 
are  centered  on  the  symmetry  point  of  the  distribution,  #*..,  the  interdirection  count 
(i)  is  invariant  under  nonsingular  linear  transformations,  (ii)  uses  only  the  direction  of 
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each  (X'  '  —  0k)  from  the  origin,  and  (iii)  has  a  distribution-free  property.  Properties 
(i)  and  (ii)  continue  to  hold  when  centering  about  0k,  because  of  its  equivariance 
under  Q  but   property  (iii)  fails  to  hold.    Then,  since  the  interdirection  proportions 

are  invariant  under  the  group  Q,  Qn{Q\,Qi)  is  as  well.  Notice  that  if  r\  =  r2  =  1  (the 
bivariate  case),  then 

cos(7r;3,(Af),  Af '»;  9k))  =  sgn  (X?]  -  §k)  sgn  (xf  -  §k)  , 


so  that 


Qn(9l,92) 


n 


Esgn(Af»-^)sgn(Af'-^) 


u=i 


which  implies  that  Qn(X^\X^)  =  [y/nq')2.  Thus,  as  the  name  indicates,  Qn(9\,9-t) 
is  an  extension  of  the  bivariate  quadrant  statistic.  In  the  next  section,  we  derive  the 
limiting  null  distribution  for  the  interdirection  quadrant  statistic  in  the  case  where 
6]  and  62  are  assumed  known. 

2.2     Null  Distribution  When  (6i,0->)  Is  Known 

In  establishing  the  limiting  null  distribution  of  Q„(#i,#2)  we  first  seek  the  lim- 
iting null  distribution  of  a  simpler  approximating  quantity.  The  simplicity  derives 
from  the  fact  that  the  interdirection  function  pk(-,-;6k)  is  replaced  by  its  expected 
value  ptc{-,  -',0k)  under  the  null  hypothesis.  All  calculations  in  this  section  are  assumed 
to  be  done  under  the  null  hypothesis,  so  no  further  explicit  mention  of  this  fact  will 
be  made.  Because  Q(0\,  92)  is  invariant  under  the  group  £/,  without  loss  of  generality 
we  assume  that  fk  (x^j  represents  a  distribution  that  is  spherically  symmetric  and 
centered  at  the  origin  (i.e.,  0k  =  0  and  S*  =  ITk-)  Define 


».  ,.     h     *i 
r\r2 


Qn  = ^^cos(7rp1(i,j'))cos(7rp2(i,i)), 


7? 

t  =  l  ]  =  \ 
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where  we  have  kept  just  the  subscripts  i  and  j,  since  the  subscript  k  on  Pk{i,j)  (and 
also  Pk(hj))  indicates  whether  it  is  a  function  of  the  X'J''s  or  X'2*'s,  and 


a  hyperplane  defined  by  r/t  —  1 
X*  ''s  and  the  origin  lies  between 


X{,k)  and  X 


(k) 


(«■■) 


XW  X(fc) 


■w> 


=  (Angle  between  X\K)  and  X}  ;)A- 

The  last  equality  follows  because  of  the  underlying  spherical  symmetry.  Let  X^K'  = 
RWZJW,  where  ftW  =  \\X{k)\\  and  L/w  =  Xw /\\X{k)\\.  Spherical  symmetry  guar- 
antees that  R\  , . . . ,  R%>  are  positive  quantities  independent  of  U\  , . . . ,  C7}v ',  and 
the  £/,  s  are  iid  uniform  on  the  unit  hypersphere  of  dimension  r^.  Here  £/'  '  is  the 
direction  of  X'  ',  and  it  is  easy  to  see  that  both  Pk{i,j)  and  Pk(i,j)  depend  only  on 


r(*) 


r(*) 


the  directions  U)     and  U\  '.  Now 


so  that 


cos(wpk(iiJ))  —  cos(iingle  between  U\  '  and  Uj)  =  U\  '  Uj, 


Qn=r-^£±UV'U?W'U?] 


i=l  j=l 


EE 

s=l  (=1 


E  y/wUSJU®  ' 


(2.1) 


La=l 


Some  moments  involving  C/'1'  and  C/'2'  will  be  useful  in  subsequent  derivations.    If 
C/(,)  and  U{2)  are  independent  with  t/(1)  ~  Uniform(flr, )  and  U{2)  ~  Uniform(Ora), 
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then  it,  is  easy  to  show  that 

1 


eHo  [uwy]  =  ±.irt 

EHo  \uW'muW]  =  -vec(M)'vec(Ir4 


(2.2) 


1 


EWo  \U{1)'M,U^U^'M2U^]  =  —vec(M1)'vec(Af2). 

J       rir2 

Two  basic  classical  limit  results  are  restated  here  for  future  reference. 

Lemma  2.2.1   a'Zn  ~  AN(a'n,a"£a)  for  any  a  ^  0  if  and  only  if  Zn  ~  ,4jV(/i,£). 

Proof  of  Lemma  2.2.1    (See  Serfling,  1980,  p.  18).  □ 

Lemma  2.2.2  If  Zn  ~  AN{fl,Ik),  then  Z^Zn  -4  xft/i'/i). 

Proof  of  Lemma  2.2.2  (See  Serfling,  1980,  p.   128).  □ 

We  now  have  all  the  necessary  tools  to  derive  the  asymptotic  distribution  of  the 
interdirection  quadrant  statistic.  We  begin  by  finding  the  limiting  distribution  of  the 
approximating  quantity  Qn. 

Theorem  2.2.1    Qn  -»  xllT2- 

Proof  of  Theorem  2.2. 1  Let  B  =  {65«}r,Xr.2  be  an  arbitrary  matrix  of  constants  that 
are  not  all  zero  and  define  Z  =  {££=1  y/F^U^U^n^.  Then 

vec(B)\ec(Z)  =  J2Y,bstZst 


5=1    (  =  1 


a=l 


n      r2 


EE^^ii1^ 


Ls=l  (=1 


=  Eu{ai)\v^B)u^\ 


«=i 
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which,  using  (2.2),  is  seen  to  be  a  sum  of  iid  random  variables  with  mean  zero  and  vari- 
ance vec(B)'ver(B).  Now  n_1/2vec(J3)'  vec(Z)  ~  AN(0,  vec(B)'  vec(JB))  via  the 
central  limit  theorem.  Lemmas  2.2.1  and  2.2.2  then  imply  that  n-1vec  (Z)'  vec  (Z)  — » 
\2riT2.  The  result  follows  by  noting  that  rr1vec(Z)'  vec(Z)  =  n"1  £^1,  ££i  Zs2(, 
which,  referring  to  (2.1),  is  just  Qn.  D 

Now  using  arguments  similar  to  those  in  Randies  (1989),  we  are  able  to  find  the 
limiting  distribution  of  Qn. 


Theorem  2.2.2  Qn  — >  x2 


T\r2 


Proof  of  Theorem  2.2.2 

\2 


E//o 


rir2 


n 


^cos(7rp1(i,j))cos(7r;32(i,j))  -  cos{Trpl{iJ))cos(irp2(i,j)) 


r2r2        n 


«.J.«  >J 


V"    H    E"o  {cos(7rpi(i,j))cos(7rp2(i,i))-cos(7rp1(i,j))cos(7rp2(i,j))} 
{cos(Trpi(i',j'))cos(irp2(i',j'))  -  cos(irpi(i',  j'))cos(irp2(i',j'))} 
{cos(irp\(i,j))cos(Trp2(i,j))  -  cos{irpi(i,j))  cos(tt  p2(i,j))} 


2  „.2 

1 LW0 


2r2r*n(n-l) 

Hi 


where  the  last  equality  is  seen  by  considering  the  following.  Let  U\  =  DlAl,  where 
D{  =  sgn  ({/,-,  )  and  A{  =  sgn  ((/,*,  )  £/'  .  Then  A,-  shows  the  observed  axis  and  D, 
indicates  which  end  of  the  axis  was  observed.  Note  that  D,  is  independent  of  A{  and 
D], . . . ,  Dn  are  iid  Bernoulli  random  variables  with  probability  of  success  1/2.  Taking 
the  expectation  first  with  respect  to  the  D's,  when  one  or  more  of  the  four  subscripts 
is  unique,  the  expected  value  is  zero.  Further,  if  i  =  j  or  i'  =  j'  the  integrand  is 
zero  since  ;3fc(i,?')  =  pk(i,i)  =  0.   Now  the  last  expectation  converges  to  zero  because 
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Randies  has  shown  that  pk(i,j)  =  Pk(hj)  +  op(l),  and  the  integrand  is  bounded  so  the 

Lebesgue  Dominated  Convergence  Theorem  can  be  applied.  Thus,  Qn  =  Qn  +  op(l), 
and  Theorem  2.2.1  yields  the  desired  result.  □ 

We  have  now  established  that  Qn  has  a  convenient  asymptotic  null  distribution, 
which  makes  it  a  viable  option  for  the  present  hypothesis  testing  scenario.  Further, 
since  it  has  the  identical  limiting  null  distribution  as  —  n  log  V  and  —  n\ogSJ°,  the 
relative  performance  of  the  three  competitors  can  be  fairly  measured.  However,  in 
practice  the  values  of  6\  and  62  are  rarely  known  so  that  they  must  be  estimated. 
The  next  section  considers  this  situation. 

2.3     Null  Distribution  When  (6i,6>)  Is  Unknown 

Unfortunately,  when  (6^,6  2)  is  replaced  by  (61,62),  the  proof  of  convergence  to 
the  \ir,r2  distribution  is  much  more  difficult.  We  wish  to  find  sufficient  conditions 
under  which  Qn(6x,62)  =  Qn(6\,62)  +  oP(l).  Since 

Qn(01,02)-Qn(01,02) 


nr2 


n 


'V2 


n 


71 

^{cos(7rp1(z,j;01))cos(7rp2(z,j;02))  -  cos(irpi(i,j]0i))  cos(TTp2(i,j]02))} 

i,j 

n 

^2  [cos^p^i,  j;  6^)  cos(Trp2(i,  j;  62))  -  cos(xp!{i,  j;di))cos(Tcp2(i,j;02))} 


+  -L^y"{cos(7rp1(?:,;;01))cos(7rp2(«,;;^2))  -  cos(npi(i,  j;  6^))  cos(irp2(i,  j;  62))\ 
=  B-[n  +  B2n,  (2.3) 


it  suffices  to  show  that  both  BXn  -^  0  and  J?2»i  -^  0.  Consider  B\n  first.  The  strategy 
for  dealing  with  B\n  is  to  show  that  it  suffices  for  the  second  conditional  moment 
of  B\n  to  converge  in  probability  to  zero.   The  conditional  moment  is  useful  in  that 
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it  "separates"  B\n  into  a  sum  of  terms  whose  factors  involve  only  the  X^'^s  or  only 
the  X'2''s.  The  limiting  behavior  of  these  factors  is  then  established  in  Appendix  A. 

More  formally,  let  .V,  =  {X\l\  i  =  L,...,n},  X2  =  {X(-2\  i  =  l,...,n},  and  11  = 
{/?,,  i=  1, .. .,??.},  where  1Z  is  a  permutation  of  the  integers  {l,...,n}.  Conditional 
on  A'i  and  A2,  the  only  random  component  of  B\n  is  the  way  in  which  the  X^'s 
and  .X^2''s  are  matched.  In  other  words,  given  X\  and  A2,  under  the  null  hypothesis 
B\n  is  a  function  of  {(XR  ,Xi  ),  i  =  l,...,n},  where  TZ  is  uniformly  distributed 
over  all  permutations  of  {1,. . .  ,  n}.  Given  this  setup,  we  prove  the  following  lemma. 

Lemma  2.3.1   If  EHo  [£2„|  X1,X2]  -^  0,  then  Bln  A  0. 

Proof  of  Lemma  2.3.1   Since 

Bin  =  J-^-y"cos(7rp1(i,;;01)){cos(7rp2(«,j;^2))  -  cos(Tp2(i,j\02))} , 


G„(A'1,A'2)  =  EHo  \Bin\  A'l5A2 


JHo 


rjT'2 


n 


J2c(R-;,R:)d(i,j) 


«,j 


where 


and 


c(z,i)  =cos(irpi{i,j\9\)) 


d(i,j)  =  cos(ir p2(i  J -O-i))  -  cos(tt  p2{i,j;02)). 


For  e  >  0  and  6  >  0, 

P[\Bln\>e]  =  P 


H 


B^nl  >  e,  C7»(Ari,Ai)  >  — 


+  P 


<P 


2  c 

Gn(XuX2)>  €— 


+  EWo 


2  c 

\B\n\  >  e,  Gn(Xi,X2)  <  —— 


e28' 


I    |5i„|>e,  G'n(A1,A2)<  — 


which  for  71  sufficiently  large, 


<  -  +  E„ 

—    ■)  Ho 


JH0 


/     l^iul  >e,  Gn(XuX2)<  - 


A',,  A 


1 1  '  L2 


=  |  +  eWo  /  ]Gn{xx,x2)  <e-f)r  [\Bm\  >  t  l  xu x2] 


6 
2   ' 


<  x  +  EHi 


I\Gn{Xl,X2)< 


,26\    1 


2    /  e2    "° 


E^j^jA^] 


2         Wo 


<  -  +  E„ 

—   9  "o 


=  6.      n 


2  j  e2 


/(6*7l(AllA'2)<   — ]  ^G„(A!i,Ar3) 


If!* 

e2   2 
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In  light  of  Lemma  2.3.1,  it  suffices  to  show  Gn(X\,  X2)  — *  0.  With  this  in  mind,  define 


<>J 


where  the  last  equality  follows  since  d(i,i)  =  0.  This  means  that  Gn{Xi,X2)  = 
EH  [S  ],  so  that  in  order  to  write  out  the  expression  for  G'n(A'1,A2)  we  need  only 
find  the  second  moment  of  S.  First,  we  will  need  some  preliminary  results  regard- 
ing the  moments  of  c(/?,,  Rj).  Let  Pn  represent  the  collection  of  all  permutations 
of  {1, . . .  ,n},  nCk  the  collection  of  all  subsets  of  size  k  from  {1, . . .  ,  n},  and  nD^  the 
collection  of  all  subsets  of  size  k  and  their  permutations  from  {1, . . .  ,n}. 


17 


Clearly,  for  i  ^  j, 


VHo[c(Ri,Rj)]=     E     c(a.-Iai)P[K  =  a] 
a  e  Pn 

E     c(/31,^2)P[i?i  =  /51,JRj  =  /52] 
E     <<ft,A) 


n(n-1}/3enD2 
=  c, 

and  in  a  completely  analogous  way  we  have  for  i   ^  j  and  i'  ^  j'  that  if  i'  =  i 
and  j'  =  j,  then 

1 


^"V^2 


{c(^,/?2)-c} 


C\ 


n(n  —  1)' 


and  if  i'  =  z  or  j'  =  j  (but  not  both),  then 
CovHo  [c{Ri,  Rj)c(Ri,,  Rj.)]  =  CovHo  [c(Rh  Rj)c(Ri,  R3,)) 

1 


n{n  ~  l){n  ~  2)  f3e  nD 

C2 

n(n  —  l)(n  —  2) 


E      {c{Pufa)-c}{c{f)ufo)-c} 


and  if  neither  i'  =  ?'  nor  j'  =  j,  then 
CovHo  [cC^.^-Hfti,^)] 

1 


n(n-l)(n-2)(n-3)(3enD4 


E    {c(A,A)-c}{c(Ai.A)-e} 


c3 


n(n-  \){n  -  2)(n  -  3) ' 
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Let  d,  d\,  d2,  and  ^3  be  the  analogous  quantities  in  d(i,j).  Now  using  these  expressions 
for  OovHo  [c(/fc,  Rj)c{I?„  /?;)],  we  see  that 


VWo  [5]  =  VWo 


J^dihMRiiRj) 


Cl 


n(n  —  1) 


+ 


E   AA,A)  + 


4c2 


/3e„£>2 
c3 


n(n-  l)(n-2) 


E     d{ftu(h)d(^(h) 

/36„£>3 


n(n  -  l)(n  -  2)(n  -  3)  flgj} 


E     d(fa,(12)d((]3,/14), 


and  since 


V, 


Wo 


Y,d(i,J)<Ri,Rj) 


V, 


Wc 


we  have  that 


V„0[S]  =    -^ 


4cW 


2  "2 


+ 


c^d3 


n 


(n-1)       n(n-l)(n-2)       ra(n  -  l)(n  -  2)(n  -  3) 


Note  also  that  since  H"^j{c(z,jf)  —  c}  =  0, 


E{c(i,j)-c}j    =ci+4c2  +  c3  =  0, 


v«'*7 


so  that  C3  =  — (ci  +  4c-2),  where  a  similar  result  holds  for  the  <fs.  Finally, 


Ti 


71  (n  —  1  )cd. 
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Since  EHa  [S2]  =  VWo  [S]  +  E2Ho  [S],  Gn{,\'uX2)  can  be  expressed  as 


Gn{XuX2) 


.2„2 


tr 


Cjd] 


+ 


4e2d2 


n{n  —  1)       n(n  —  l)(rc  —  2) 


(Cl  +4c2)(^  +4d2) 
n(n-  l)(n  -  2)(n  -3) 


+  (/?(??  -  i)cctj' 


To  show  G'n(A'1,  A'2)  -^  0  (and  hence  that  B\n  — >  0),  it  suffices  to  show  c  =  op(?i  x/2), 
c,  =  0(u2),  c2  =  op(n5/2),  d  =  op(n_1/'2),  di  =  op(n2),  and  d2  =  oP(«5/2)-  As  stated, 
these  results,  along  with  the  requisite  assumptions  needed,  are  in  Appendix  A. 

Recalling  that  Qn{0\,  O2)  —  Qn(&i,  02)  =  Bln  +  B2n  (see  (2.3)),  we  must  now  show 
B2n  -^  0  to  complete  the  argument  that  <2„(0i,02)  -  Qn(Ox,d2)  +  op(l).  Of  course 
it  suffices  to  show  EHq  [B2n]  — >  0.  Since 

11 

B2n  =  -^-Y^cos(irp2(ij;02)){cos(Trp1(ij;e1))  -cos(7rPi(^i;^i))}) 


we  have 


EH„  [BL 


r2    2        n 

'  l'2 


7r 


Z)  E«o  [cos(7rP2(J,j;^2))cos(7rp2(?',,i';^2))] 


J.Jil  -,3 


xE 


Ho 


{cos(7rp1(z,j;^1))  -  cos(7rpi(z,  j;  0i))} 


X 


{cos(7T/)1(z',j';^1))-cos(7rp1(Z',j';0]))} 


2r\r\n(n  —  1) 


E 


71" 


Ho 


cos2(7rp2(i,j;02)) 


x  E 


Ho 


{cos(7rpi(z,j;i?i))  -  cos('rrpl{i,j;02))} 


where  the  last  statement  follows  from  logic  similar  to  that  used  in  Theorem  2.2.2. 
Then  using  the  fact  that  the  integrand  in  the  second  expectation  is  bounded,  applying 
Lemma  A. 0.6  and  the  Lebesgue  Dominated  Convergence  Theorem  yields  the  result. 


CHAPTER   3 
PITMAN  ASYMPTOTIC  RELATIVE  EFFICIENCIES 


3.1      Introduction 

To  compute  Pitman  ARE's,  a  model  of  dependence  must  be  adopted  to  serve  as  an 
alternative  to  the  null  hypothesis  of  independence.  Konijn  (1954,  p.  300)  states  that 
"the  crucial  point  is  the  specification  of  a  class  of  alternatives  which  is  (i)  sufficiently 
wide  to  include  some  approximation  to  any  situation  that  may  arise  in  this  class 
of  problems,  and  (ii)  manageable  mathematically."  For  tests  involving  a  change  in 
location  of  a  distribution,  shift  alternatives  form  a  satisfactory  idealization  to  a  wide 
class  of  problems  and  are  quite  amenable  to  mathematical  analysis.  But  because  of  the 
innumerable  ways  dependence  can  manifest  itself,  our  situation  cannot  be  expected  to 
lend  itself  as  easily  to  so  simple  a  model.  This  is  not  necessarily  a  fatal  blow,  however, 
since  when  considering  a  model  in  the  context  of  local  alternatives,  there  is  reason  to 
believe  the  specific  form  of  the  model  is  of  little  consequence.  Witness  the  agreement 
in  the  Pitman  ARE's  reported  in  Section  1.1  for  the  bivariate  case  when  several 
different  models  were  used.  Thus,  although  we  propose  a  model  that  is  intuitively 
appealing  in  some  aspects,  our  main  reason  for  choosing  it  is  for  its  mathematical 
tractability.  We  require  that  the  model  be  a  function  of  a  nonnegative  real-valued 
parameter  A  such  that  as  A  — >  0,  the  sequence  of  alternatives  defined  by  this  model 
will  converge  to  the  null  hypothesis.  In  fact,  it  is  necessary  that  the  convergence 
of  this  sequence  of  alternatives  occurs  at  such  a  rate  so  that  it  is  contiguous  to  the 
null  hypothesis.  This  necessity  is  two-fold.  In  doing  calculations  under  the  sequence 
of  alternatives,  contiguity  allows  us  to  use  an  approximating  quantity  in  finding  the 
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limiting  distribution  of  statistics  of  interest  and  also  aids  in  determining  the  form  of 
that  limiting  distribution. 

3.2     Model  1 


A  generalization  of  the  model  apparently  first  studied  by  Konijn  (1954)  is  given 


by 


X 


y('2) 


(l  -  A)y(1)  +  AM,y(2) 

AM2Y(,)  +  (1  -A)Y{2) 


(l-A)/r,         AM, 
AM2        (l-A)/r2 


AAY, 


(3.1) 


where  Y^'  and  Y^2'  are  independent  random  vectors  that  are  ?i  x  1  and  ?"2  x  1,  respec- 
tively, M\  and  M2  are  arbitrary  (known)  matrices  of  dimensions  ri  x  r2  and  r2  *  ri, 
respectively,  and  0  <  A  <  1/2.  Notice  that  for  rx  =  r2  =  r  and  Afj  =  M~2  =  It,  A  = 
1/2  implies  X^'  =  X"'  (perfect  correlation),  while  A  =  0  corresponds  to  the  null 
hypothesis  of  independence.  Thus  we  can  restate  the  testing  problem  as  Hq  :  A  =  0 
vs.     H\  :  0  <  A  <  1/2.    Since  Y   =  A~^ X  is  a  nonsingular  linear  transformation, 

the  density  function  of  X  can  be  expressed  as  fx  {x;  A)  =  abs  (|Aa|" ~x)  fy  (A~^x\, 
where  fy  (y)  =  /1  (y^M/2  (l/2')  is  the  density  function  of  Y .   We  assume  that  the 

distributions  of  both  Y*1'  and  Y*-2'  are  elliptically  symmetric  with  dispersion  param- 
eters Si  and  £2,  respectively,  and  are  centered  at  dx  and  62,  respectively.  In  other 
words,  fk  (x{h))  =  Ckgk  ((a;W  -  6k)'J:^{xw  -  0kj),  where  Sfc  is  a  positive-definite 
matrix  and  gk  ()  does  not  depend  on  6k  or  S^.. 
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Such  a  model  might  conceivably  arise  when  considering  a  battery  of  psychological 
or  psychophysical  tests  administered  to  a  group  of  subjects,  with  the  goal  of  classifying 
the  outcomes  relative  to  certain  independent  "factors."  Suppose  that  apparently  the 
outcomes  of  one  set  of  tests  are  practically  determined  by  one  factor,  and  the  outcomes 
of  another  set  of  tests  are  practically  determined  by  a  second  independent  factor.  In 
order  to  test  this  hypothesis,  Model  1  could  be  used,  since  the  alternative  might 
reasonably  be  that  all  the  outcomes  depend,  to  varying  degrees,  on  both  factors. 

3.2.1     Contiguity 

We  wish  to  show  that  the  sequence  of  alternatives  H\  :  An  =  ?j_1'2A0,  where 
Ao  >  0,  is  contiguous  to  the  null  hypothesis.  To  achieve  this  we  follow  the  rationale 
of  Hajek  and  Sidak  (1967,  pp.  201-214),  which  we  outline  here.  Let  L(a;;An)  = 
/x(a;;An)//jr(ic;0),  and  An  =  logrEU  L(Xt\  An)  =  E?^ogL(Xi;An).  LeCam's 
first  lemma  asserts  that  if  An  ~  AN(—a2/2,cr2),  then  the  densities  n™=i  fx  \xi\  An) 
are  contiguous  to  the  densities  Y\i=\  fx  (^jjO).  Another  way  of  expressing  this  is  to 
say  that  the  sequence  of  alternatives  An  is  contiguous  to  the  null  hypothesis  (A0  =  0). 
If  Wn  =  2£)™_1[L(X,-;  An)1/2  —  1],  LeCam's  second  lemma  states  that  contiguity  will 
follow  if,  under  Ho,  the  summands  log  L(Xi]  An)  are  uniformly  asymptotically  negli- 
gible (UAN)  and  Wn  ~  AN(—a2/4,  a2).  Because  the  summands  depend  on  n,  finding 
the  limiting  distribution  of  Wn  directly  is  quite  difficult  so  they  consider  the  first-order 
approximation  of  Wn.  This  approximation  is  expressed  as  Tn  =  An  J2?=i  L'(Xi;0), 
where  Z/(x;0)  =  j^L(x,  A)|a=o-  Hajek  and  Sidak  demonstrate  the  contiguity  for 
a  univariate  shift  alternative  (pp.  210-213)  and  a  univariate  scale  alternative  (pp. 
213-214)  by  showing  that  under  the  null  hypothesis,  Wn  =  Tn  —  cr2/4  +  op(l),  the 
UAN  condition  holds,  and  Tn  ~  AN(0,a2).  Randies  (1989)  has  extended  the  argu- 
ments to  show  contiguity  for  a  multivariate  shift  alternative.    Noting  that  Model  1 
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is  a  multivariate  extension  of  a  scale  alternative  leads  us  to  emulate  the  methods  of 
Randies  in  extending  the  proof  of  contiguity.  A  sketch  of  this  extension  is  included  in 
Appendix  B.  Since  we  will  need  the  form  of  Tn  in  determining  limiting  distributions 
under  Ani  we  prove  the  asymptotic  normality  of  Tn  presently.  Considering  previ- 
ous discussion  regarding  the  invariance  of  the  statistics  — nlog  V  and  Qn,  we  assume 
hereafter  that  0^  =  0  and  Sjt  =  ITk.  We  discuss  the  ramifications  of  this  assumption 
with  respect  to  —  n\ogSJ°  later.  First  we  need  to  find  the  expression  for  L'(x;Q). 

Lemma  3.2.1    For  Model  1  (given  in  (3.1)), 

L'(x;0)  =  2  (ajW'asW&fcW'ajW)  +  -) 

+  2  (xW'xWfaixW'xW)  +  -) 

-  2a:*1)'  (h(x^'x^)M,  +  <f>2(x^'x^)M'2)  xS2\ 
where  <f>k(t)  =  g'k{t)lgk(t). 
Proof  of  Lemma  3.2.1   Since 

L(a?;A)  =  abs(|AA|-1)-/    )    A    \ 

v  J       fy  {x) 

we  see  that 


Now 


abs  (IAaT1)  =  -abs  (IAaI"1)  tr  (-A^P)  , 


3A 


w 


here 
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so 


that 


Also 


so  that 


because 


d 


-,abs(L4A|-<) 


dA 


A  =  (J 


=  ahs(|A0|-1)tr(Al7,P) 

=  abs(|/ri+r2|-1)tr(/-I+r2P) 
=  tr(P)  =  r,  +  r2. 


d  fyJA^x)        /  Q  \Vfy(x) 

dA        fy(X)  -[dA^*)        fy(X) 

(x)   ' 


fv  {A-A'x) 


fv  (x) 


A=0 


=  (a?PA?x)'^ 

(A-'PA-'x)  -jlj-y 

(r>    pi-1    xVYJxM 


=  (Px) 


^fy(x) 


fy  (x) 


■Mi      /..    /  \XM 


=  2{xW  -  MixWyxWfaixW'xW) 

VA  (*(fc))  =      {k)  A  (*<*>'*<*>) 
fk(xW)  ft  (ajW'asW) " 


The  result  follows  immediately.  □ 
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Recall  that  we  can  represent  X(fc)  as  R^U^  when  X^  has  a  spherically  symmetric 
distribution  (see  Section  2.2).   Using  this  form  for  X^k'  gives 


rn  =  n-i/2£2A0  ((/^M^)2)  +  y 


i=i 


+  ((^»)^((^))^)  +  ^)-f/!1)/JR!c/!2)}, 


where 


r,  =  rVrW  {^((R^M,  +  U{R(?)2)M'2) 


Lemma  3.2.2  If  EHo  \{R^Y<f>l{{R^f)\    <   oo,  EWo  [(i2«)2$((i2(fc>)2)]    <   oo,  and 

EHo  [(R{k))2]  <  oo,  then  V„o  [Tn]  =  a2  <  oo  and  7nn  ~  /1/V(0,a2). 

Proof  of  Lemma  3.2.2  To  guarantee  that  a2  <  oo,  it  is  sufficient  for 


E 


Ho 


((RwyM(R(1))2)  +  y)  +  ((/?(2))2^((/?(2))2)  +  y)  -  trw'ia/w 


<  oo. 


w 


here 


i*  =  /?<1)/?(2)  (^((fiWfjM,  +  ^((^2»)2)m:2)  . 

Thus  a2  <  oo  if  EHo  [(^fc))Vl((^(fc))2)]  <  oo  and  EWo    (Uw> RU{2)) 


<  oo,  where 


the  second  expectation  is  easily  seen  to  be  finite  if  EH    \(R^)2(j)l((R^)2)    <  oo  and 


JH0 


(Z?W)2     <  oo     Appendix  B  shows  further  that  if  a2  <  oo,  then  EH  [T„]  =  0. 


Since  the  terms  are  iid,  an  application  of  the  central  limit  theorem  gives  the  result. 
□ 

Thus,  we  have  established  conditions  under  which  contiguity  holds.     In  the  next 
section  we  work  out  the  limiting  distributions  of  Q„,  —  n  log  V,  and  —  n  log  .9  °. 
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3.2.2      Limiting  Distributions  Under  A,, 


LeGam's  third  lemma  states  that  if,  under  Ho,  f^J  ~  AN((  -<?\li )'  ( "x  "^))^ 


then   5'„   ~   AN(fi\  +  cr12,0"i)   under  a  contiguous  sequence  of  alternatives.     Find- 


ing the  limiting  distributions  of  Qn,  — nlogK,  and   —  n  log  SJ°  under  the  the  con- 
tiguous sequence  of  alternatives  An  will  involve  showing  that  under  //0,    ( £"  )    ~ 

AN((  qJ,  (J1   °\2  J  J,  where  Sn  is  an  appropriately  defined  statistic.  Then,  under  An, 


•'2 

Sn  ~  AA^(cr12,cr^)  and  from  this  it  will  be  possible  to  determine  the  limiting  distri- 


bution of  the  statistic  of  interest.  We  begin  by  finding  the  asymptotic  distribution 
for  Qn.  We  assume  throughout  the  rest  of  the  chapter  that  the  moment  conditions 
in  Lemma  3.2.2  are  satisfied. 

Theorem  3.2.1    Under  A„ , 

Qn  -i  xt>r2  (-—  vec  (EHo  [jq)'vec  (EHo  [R]))  . 

Proof  of  Theorem  3.2.1  Let  a  =  (o^,  a^)'  be  an  arbitrary  pair  of  constants  not  both 
zero.  If  Sn  =  n-1/2E?=i  U\l)\y/r^B)U\2)  (cf.  Theorem  2.2.1),  then 


a 

'  T 


ri 


-'"  £>„{«,  ((ft;")V.,((fi!")!)  +  £)  + «,  ((/?!i,)VJ((fi!2,);)  +  £) 
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Since  EHo  [.$'„]  and  EWo  [Tn]  are  zero,  EWo  [a'(Sn,  Tn)']  =  0.  Also,  the  summands  are  iid 
where  the  three  terms  in  each  summand  are  uncorrelated  with  mean  zero  so  that, 
V„o  [a'{Sn,Tn)]  =  a'X0  [Sn]  +  a\VHa  [Tn]  +  2a1a2Cov//o  [Sn,Tn] 

=  a'fvec  (B)'  vec  (S)  +  a\a2 

+  2«,«2  ■  2AON/nrIEWo  [l7(5)'BZ/(2)Lr(1),BC/(2)]  . 


Further,  using  (2.2), 


EHo  [U^RU^U^BU^l  =  -EHo  [UWRB'UW 


1 


— vec(B)/vec(EWo[fi]), 


so  that  a2  =  vec  (B)' vec  (B),  a\  =  a2,  and  aV2  =  vec(B)'  (^^vec  (EHo  [H])).  The 

asymptotic  normality  of  a'(Sn,Tn)'  follows  by  applying  the  central  limit  theorem. 
Then  under  A,,, 


Sn  ~  4/V  (vec(B)'  f--^vec(EWo  [B])j  ,vec(B)' 


vec(B)     , 


with  Lemmas  2.2.1  and  2.2.2  giving 

'4A2 


Qn  ^  X™  (  ^VeC  (E"°  [R])' vec  (E"o  [R])j  ■ 


The  result  follows  by  noting  that  since  the  difference  between  Qn  and  Qn  converges  in 
probability  to  zero  under  the  null  hypothesis  (see  the  proof  of  Theorem  2.2.2),  by  def- 
inition, the  same  difference  will  converge  in  probability  to  zero  under  any  contiguous 
sequence  of  alternatives.  Thus  Qn  and  Qn  will  have  the  same  limiting  distribution 
under  A„.  □ 

We  next  find  the  asymptotic  distribution  of  —  n  log  K  under  An.    It  is  easy  to 
show,  using  (/-statistics,  that  if  EHa  [Xft]  <  oo,  then  —  ?i  log  V  — ►  xlir2  under  H0.  To 
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determine  the  limiting  distribution  under  An,  we  need  to  find  a  simple  approximating 
quantity.  Puri  and  Sen  (1971,  p.  364)  show  that 


!■]        r,        r2       r2 

-n  log  V  ~nY.Y.Y.Y.  PsthfPsJ'pS 


5=1    s'  =  \    (  =  1    t'  =  \ 


0P(n-X) 


where  p,  the  sample  correlation  matrix,  is  partitioned  into  p,,1  =  {/>S5<  }n  xr, ,  p22   = 

{ptt'  }t2xt2,  and  p12  =  {pst}r,xr2  =  p'n-    Since  pkk  =  ITk  +  op(l),  we  use  Slutsky's 
Theorem  to  achieve  the  following  simplification, 

-nlogV  =  nvec(p12)'(p~1  ®  p22)vec(Pi2)  +0p(n_1) 
=  nvec(p12)'vec(p12)  +  op(l). 

Then,  since  p12  =  7i_1  A12  and  rc-1  £)?=i  Xj     =  op(l),  we  use  Slutsky's  theorem  again 
to  get 

—  n  log  V  —-  n_1vec(A12)'vec(Ar2)  +  op(l) 

-l,»w  if  v(l)  *-(*)'  1    „«,„  (sT  y(l)  v(2)'X 


n     vec 


£XW  UecExWHW 


W=l  /  \:  =  l 


_1   ' 

=  n 


'  ±±  tr  (( X\» X  f )' Xf'xf  A +or(l) 


(3.2) 


t=i  j=i 


1=1  J=1 

Thus  we  have  a  convenient  approximating  quantity  for  — nlogK.  In  fact,  comparing 
the  approximations  for  Qn  in  (2.1)  and  —  ?! dog  V  in  (3.2)  reveals  the  underlying  simi- 
larity in  structure  between  these  two  statistics,  which  at  first  is  not  readily  apparent. 
To  find  the  limiting  distribution  of  —7?  log  V  under  An,  we  proceed  as  in  the  proof  of 
Theorem  3.2.1. 
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Theorem  3.2.2   Under  A„, 

-nlogV  ±  x2riT2  (^v-(E«o  [«(,,«(2)JR])'vec(E//o  {r^R^R}) 


Proof  of  Theorem  3.2.2   Let 

Sn  =  n 


1/2  Y.U^' {R(^ R^ B)Uf] . 


=  n 


n 

x'=i 


The  key  quantity  is  again 

CT12  =  CovHo  [Sn,Tn] 


=  2A0EWo 


uMruw'uWrwrivbuw} 


J 


,/  /2A0 


vec(B)'  (f£^vec(EWo  [R^R^R] 


\ 


\rxr2 
Applying  Lemma  2.2.1  and  then  Lemma  2.2.2  yields  the  result.  □ 

If  we  adopt  the  notations  sgn  (x)  =  (sgn  (xi) , . . .  ,sgn  (xn))'  and  x  =  (xi, . . .  ,xn)', 
then  use  of  the  score  function  Jo  defined  by  (1.2)  enables  us  to  represent  the  matrix  TV2 
described  in  Section  1.2.2  as 

T12  =  n"1  £>gn  (xf>  -  I<")  sgn  (X?  -  X«)'. 

Further,  using  Theorem  2.13  in  Randies  (1982)  on  the  components  of  T12,  we  are  free 

to  replace  the  sample  medians  by  the  population  medians  (which  we  have  assumed 

are  zero)  when  considering  the  asymptotic  distribution  of  Ty2.   Continuing,  we  now 

have 

T12  =  n-'  £sgn  (X,(,))  sgn  (xf2))'  +  op(l) 
t=i 


^Esgn^sgn^'  +  o^l). 


i=i 
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Combining  this  with  the  result  that  -n  log  SJ°  =  nvec  (Tr2)'  vec  (T12)  +  Op(n_1 )  (see 
Puri  &  Sen,  1971,  p.  359),  shows  that 

-  7)  log  SJ° 

=  n-vec  (£sgn  K')  «W>  W)#vec  (£  sgn  (tf  «)  sgn  (tfj2>)')  +  0p(l) 

«'=1  .7  =  1 

where  again  it  is  of  interest  to  note  how  this  approximation  compares  to  those  of  Qn 
in  (2.1)  and  —77  log  V  in  (3.2). 

On  a  cautionary  note,  because  of  the  noninvariance  of  —  7i\ogSJo,  all  subsequent 
derivations  apply  only  when  2*  is  in  reality  a  diagonal  matrix.  Recall  with  the  other 
statistics  there  was  no  loss  of  generality  because  of  their  invariance  under  Q.  This  is 
probably  not  a  serious  concern,  since  as  Randies  (1989)  has  noted  in  the  comparison  of 
his  (affine-invariant)  interdirection  sign  test  with  a  (nonaffine-invariant)  component- 
wise sign  test,  spherical  symmetry  is  favorable  to  the  component-wise  test.  He  further 
states  that  although  it  might  be  possible  to  improve  slightly  the  efficiency  of  the 
component-wise  test  over  certain  points  in  the  alternative,  the  result  is  a  drastic 
depreciation  of  its  efficiency  over  the  rest  of  the  alternative  space.  This  generally  is 
not  a  desirable  property  of  a  statistic.  The  same  logic  applies  in  the  present  situation. 
After  deriving  a  moment  needed  in  the  subsequent  theorem,  we  proceed  exactly  as  in 
Theorem  3.2.1. 

Lemma  3.2.3  If  U(k)  ~  Uniform(ftrJ,  then 

E[K/((fc)l1 


y/irl 
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Proof  of  Lemma  3.2.3  A  point  on  an  r^-dimensional  unit  sphere  can  be  uniquely  rep- 
resented (provided  r^  >  2)  by  7^  —  1  angles  7/,, .  .  .  ,  7/r/,_,  and  the  equations 

//,  =  sin  7;,  sm  7/2  .  .  .  sin  i]rk-2  sin  j]Tk :_, 
(y2v  =  sm  7/,  sm  7/2 . . .  sm  7/rfc_2  cos  7/rjk _i 
U^     =  sin  7/]  sin  772  .  .  .  sin  r)Tk_z  cos  i7rfc-2 


1)4     =  sin  771  sm  7/2 . . .  sin  f]Tk-A  cos  tyrfc-3 


C/rfe-2  =  Sill  7/i  Sill  7/2  COS  7/3 

t/rjt-1  =  sin  7/1  cos  t;2 

/yW  =  cos »/] . 

If  the  joint  density  of  the  angles  is  proportional  to  sinr*-27/i  sinr*~  7/2  ■  •  -sin7/rfc_2, 
where  0  <  rjl  <  tt,  7  =  l,...,rjfc  —  2  and  0  <  i]Tk-i  <  27r,  then  C/'*'  will  have  the 
uniform  distribution  on  the  unit  hypersphere  of  dimension  r^.  Clearly  the  angles  are 
independent,  with  rjTk-\  uniformly  distributed  on  (0,2tt)  and  the  other  angles  having 
power  sine  densities  on  (0, 7r).  For  instance,  the  marginal  density  for  7/1  is 


v-(^) 
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Thus, 


E  It/ 


rW 


=   /    |  cos  771 1  (7(771)^77! 

Jo 
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^m 


\       cos  771  sinrfc_2  771^771 

Jo 
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rfc  -  1 


Jo 


rk--2. 
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r* 


(r*-l)V5Fr 


rfc 


-  1 


rfc 


Since  If/j^l  =  1  when  rk  =  1,  the  formula  holds  for  r*  >  1.  By  symmetry,  the  result 
holds  for  Ef|t/f(fc)|l.  D 


Theorem  3.2.3  Under  An, 

(       4A2I     f^Y 


-nlog5JoAx; 


\ 


7r2r2 


°ri+v;V . /r2+iNvec  (E»-  [«])'  WC  (E«o  [«]) 


Proof  of  Theorem  3.2.3  Let  5n  =  n"1'2  E?=i  sgn  (u\1])'  Bsgn  (t/j2))  so  that  the  key 


quantity  is  again 

a -j 2  =  CovHo  [S„,Tn] 


=  2A0EHo 


t/O'iit/^sgn^'^Bsgn^2') 


=  vec(JB)'  (2A0EWo  [|^>|]  EHo  [|t/<2)|]  vec  (e„0  [R])) 


and  using  Lemma  3.2.3, 


=  vec(B)' 


<    '2A0rf^r/7'2 


7T 


— r-Vr -, r-rr~vec(EH   \R 


with  Lemma  2.2.1  and  Lemma  2.2.2  giving  the  result.  □ 


3.2.3     Comparison  of  Statistics 
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We  are  now  in  a  position  to  find  the  expressions  for  ARE(Q„,  —  nlog  V)  and 
ARE(Qn,  —  nlog  SJ°).  Since  each  of  the  statistics  have  limiting  noncentral  x\T2  ^s' 
tributions  under  An,  Hannan  (1956)  has  shown  that  the  Pitman  ARE  is  the  ratio  of 
the  noncentrality  parameters.  Referring  to  Theorem  3.2.1  and  Theorem  3.2.2  we  are 
able  to  report  that 

r1r2vec(EHo[il])/vec(EWo[JR]) 


ARE(Qn,-nlogV) 


vec(EHn  [R^R^R])' vec(EHo  [rWrWr]) 

4vec(</?1M1  +  (p2M'2)'  vec(ipiMi  +  <p2M2) 
r,r2vec(Af1  +  M'2)' vec(M ,  +  M'2) 


where 


and 


Vi 


JHo 


R^\eh\r^^{{R{1])2) 


<p*  =  eHo  [/?<'>]  eHo  [i?(2>M*(2))2)]  • 


Note  that  for  M,  =  M', 


2' 


ARE(Q„,-nlogV) 


(V3l  +  ^2? 

r\r2 


Also,  using  Theorem  3.2.1  and  Theorem  3.2.3  we  see  that 
ARE(Qn,-n\og$Jo) 


xarMI!  +  2WrH 


2 


2 


^(yf? 


(3.3) 
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where  it  is  of  interest  to  note  that  there  is  no  dependence  on  g<  and  g-i  or  the  form 
of  the  matrices  M \  and  M-2-  Of  course,  as  noted  earlier,  this  result  holds  only  when 
St  is  diagonal.  For  general  E&,  (i.e.,  elliptically  symmetric  distributions),  the  Pit- 
man ARE  will  depend  on  the  underlying  covariance  structure  of  the  X^'s  and  X'2''s. 

In  the  next  sections,  we  compute  ARE(Qn,  —  n  log  V)  for  various  choices  of  g\  and  gi. 
Because  the  formulas  for  the  Pitman  ARE's  are  quite  complex,  a  visual  aid  can 
help  reveal  some  of  their  structure.    Therefore,  we  provide  graphs  which  illustrate 
the  Pitman  ARE's  for  the  three  statistics  in  various  simplified  situations.   One  sim- 
plification is  that  identica.1  distributions  were  used  for  both  X*1'  and  X  ■'.    Recall 
that  the  Pitman  ARE  is  the  ratio  of  sample  sizes  needed  for  competing  tests  to 
maintain  the  same  limiting  power  and  size  when  converging  to  the  null  hypothe- 
sis. Thus,  if  Tn  and  Sn  are  two  competing  sequences  of  tests,  then  loosely  speaking, 
ARE(Tn,.?n)  >  (=,  <)1  implies  that  Tn  requires  fewer  (equal,  more)  observations  to 
maintain  about  the  same  power  as  Sn,  meaning  Tn  is  more  (equally  as,  less)  efficient. 
Since  normally  ARE(Tn,Sn)  is  not  necessarily  bounded,  we  have  chosen  to  plot  the 
Pitman  ARE's  for  the  dimensions  given  by  the  axes  labeled  7-1  and  r'2  using  the  trans- 
formation l/(l+ARE(.S'n,T7l)).  This  results  in  the  surface  lying  between  0  and  1  for 
any  possible  value  of  ARE(Sn,Tn).    Theoretically,  this  keeps  the  visual  comparison 
of  situations  where  the  competing  statistics  relative  performances  change  direction 
on  equal  footing.    The  surface  is  now  loosely  interpreted  as  the  ratio  of  the  sample 
size  required  for  Tn  to  the  sum  of  the  sample  sizes  required  for  both  Tn  and  Sn. 
Practically,  this  means  that  in  all  cases  Tn  is  doing  better  if  the  surface  is  below  1/2 
and  worse  if  it's  above  1/2.  For  example,  in  Figure  3.1,  Qn  is  in  the  second  position 
and  the  surface  is  below  1/2  (except  at  rl  =  r2  =  1  when  Qn  and  —  n\ogSJ°  are 
asymptotically  equivalent),  so  it  is  more  efficient  than  —  nlog.S'"7"  when  at  least  one 
of  7-l  or  r2  is  bigger  than   1   and  equally  as  efficient  in  the  bivariate  case.    Similar 
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graphs  of  the  Pitman  ARE  of  Qn  with  —  n  log  V  follow  the  sections  in  which  specific 
distributions  have  been  assumed,  allowing  for  the  evaluation  of  ip\  and  <£>2- 


Figure  3.1.   l/(l  +  ARE(-nlog  SJ°,  Qn)) 


Exponential  power  class 


A  convenient  elliptically  symmetric  class  of  distributions  is  the  exponential  power 
class.  The  exponential  parameter  v  allows  for  a  choice  of  distributions  with  varying 
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heaviness  in  their  tails.  This  will  enable  us  to  evaluate  ARE(Qn,  —  n  log  V)  when  the 
underlying  distributions  of  X^'  and  X*2'  have  either  heavy  or  light  tails.  When 
;/  =  1,  this  corresponds  to  the  multivariate  normal  distribution.  When  0  <  v  <  1,  the 
resulting  distribution  has  heavier  tails  than  the  normal  and  when  v  >  1,  the  resulting 
distribution  has  lighter  tails  than  the  normal.  In  fact,  as  u  — *  oo,  the  distribution 
becomes  uniform. 
Let 


gk{t)  =  exp  (-— )  , 


and 


^r 


ck 


Tk 


2vk 


2vk 


*TkY  {— 


VkJ  J 


Ck 


\^m 

"it 
=<*? 

vm. 

Then  X^'  has  the  exponential  power  distribution  located  at  the  origin  with  dispersion 
parameter  ITk  and  exponential  parameter  vk  (X^  '  ~  Exp(vk))-  The  density  function 
of  (R^)2  is  given  by: 

CfeW2  _./o   ,  /     r*- 


h„[t)  = 


fc/2-1 


Vk 


exp 


Ck 


(3.4) 


We  first  calculate  a  useful  moment. 


Lemma  3.2.4 


E 


//,-, 


(#(*))•]  = 


2t/fc      J  ja/2 


r(£ 


provided  r^  +  a  >  0. 
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Proof  of  Lemma  3.2.4  Let  T  have  the  density  h  given  by  (3.4).  Then 


m 


cfcw24r*+a)/2 


(rk  +  a)/{2vk)-\ 


exp(—t)dt 


V    2;/fc 


rk  +  a 

1vk     )    ,a/2 

ak    i 


\2n) 


so  long  as  rk  +  a  >  0.  D 


Theorem  3.2.4  If  X^  '    ~   Exp(z^),  with  v^   >   —  (rk  —  2)/4,   and  we  assume  that 
Mi  =  M'2,  then 

ARE(Qn,-n log  V;  1^,1/2 ) 
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?V"'2 


"l 
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(h\  V    2i/2    /     V         2i/! 


2/^2/      \2i/i 


+  J/2 


<M  V    2vx    )     V         2i/2 

27/1  /  \2i/2/       y 


Proof  of  Theorem  3.2.4  Since 


Vk 


Mt)  =  g'k(t)/gk(t)  =  --t^-\ 

Ck 
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v,  =  eHo  [rw]  eHo[r^M(R{1))2)} 


EWo  W2)\  EWo 


/2(>)/2(i)2(",-1) 


f'i 


-^E„„  [fl»]  E„„  [flW— i 

'n  +2i/,  -  l 


V\ 
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2,,2     /rfl/2 
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"2v\ 
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<2«/,-l)/2 
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.j.,/;rf^±iVf'  +  2l/'-1 


and  similarly, 


¥■2  =  —Vi 


A. 


1/2  r 


,  /r,  +  1\      /r2  +2i/2  -  1 


2i/2 


(ILV/ra 


.2^1/     \2i/2> 

The  moment  conditions  in  Lemma  3.2.2  are  easily  shown  to  be  satisfied  for  Vk  > 
—{rit  —  2)/4,  so  putting  these  values  of  ip\  and  <£>2  in  (3.3)  gives  the  result.  □ 

Note  that  the  condition  that  u^  >  — (r*  — 2)/4  is  really  a  restriction  only  when  r*  =  1, 
in  which  case  we  need  i/*  >  1/4. 

Corollary  3.2.1    If  V\  =  i/2  =  v  and  r-i  =  r2  =  r  then 

>+  1\T.  (r  +  2v  -  lx  -2 


ARE(Q„, -n  log  K;i/;r) 


2i/f 


2;/ 


2i/ 


rr2   - 


iv 


(3.5) 


It  is  interesting  to  compare  this  expression  with  the  Pitman  ARE  of  Randies'  interdi- 
rection  sign  statistic,  Vn,  and  Hotelling's  T2  from  the  multivariate  location  problem. 
It  turns  out  that 


ARE(Q„, -n  log  V;i/;r)  = 


r+1 
2i/ 


2/ J     V   2i/ 


^-ARE(K,r2;/.;r). 
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Inserting  specific  values  for  r  and  v  in  (3.5),  we  have  that 

l-2 
2vV  [ 

ARE(Qn,-nlogV>;i; 


ARE(Qn,-nlogl/;l;r)  = 


r2 


2T2 


r+  1 


•r2 


and 


ARE(Qn,-nlogV;0.5;r)  =  1, 

where  again  it  is  of  interest  to  note  that  ARE(Qn,  —  n  log  V;  l;r)  =  ARE(Vn,T2;  l;r)2. 
Using  the  facts  that  lim,,-^  r(a/u)/T(b/i>)  =  b/a  and  lim,,-^  u/T(b/i')  =  6,  we  also 
have  that 

ARE(Q?l,-nlogK;oo;r) 


r+  1 

Notice  that  for  7'  =  1  (the  bivariate  case), 

ARE(Q„,-nlogV;0.5;l)  =  1, 


ARE(Qn,-nlogV;l;l) 


7T" 


and 


ARE(Qn,-nlogK;oo;l) 


4' 


which  all  agree  with  the  values  in  Table  1.1  and  Table  1.2  (v  =  0.5  corresponds  to 
the  Laplace,  v  =  1  to  the  normal,  and  v  =  oo  to  the  uniform  distribution). 

As  indicated  earlier,  what  follows  are  several  graphs  depicting  the  nature  of  the 
Pitman  ARE  given  in  Theorem  3.2.4  for  various  values  of  ;/  =  V\  =  u2.    Clearly,  Qn 
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does  very  well  when  v  =  0.1  (Figure  3.2).  Of  course,  although  we  can  compute  and 
graph  the  Pitman  ARE  at  the  values  ?'i  =  1  and  r2  =  1,  the  expression  for  the  ARE 
is  not  valid,  so  the  graph  should  he  ignored  in  those  areas.  Qn  and  —  n  log  V  perform 
almost  equivalently  when  //  =  0.5  (Figure  3.3),  and  —  ?i  log  V  beats  Qn  when  v  =  1 
and  v  =  10  (Figures  3.4  and  3.5). 


Figure  3.2.  l/(  l  +  ARE(-nlog  V,  Qn\  u  =  0.1)) 
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Figure  3.3.  l/(l  +  ARE(-nlog  V,  Qn;  v  =  0.5)) 


Multivariate  ^-distribution  family 


The  multivariate  ^-distribution  indexed  by  its  degrees  of  freedom  df,  is  another 
convenient  elliptically  symmetric  class  of  distributions.  Again  we  can  evaluate  the 
expression  for  AREf^,  —  n  log  V)  when  the  underlying  distributions  of  X^1'  and  X^  ' 
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Figure  3.4.  l/(l+ARE(Qn,  -nlogV;  v  =  1)) 
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Figure  3.5.  1/(1+ARE(Q„,  -nlogV;  v  =  10)) 
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have  varying  tail  weights.  When  df  =  1,  this  corresponds  to  the  multivariate  Cauchy 

distribution  and  as  df  — ►  oo,  the  distribution  approaches  multivariate  normality. 

Let 

-(dfk+rk)/2 


9k(t)=     1  + 


( 
#* 


anc 


'<*A  +  rk 


Ck 


{*dfky-/*T 


dfk 


Then  X'  '  has  a  multivariate  ^-distribution  located  at  the  origin  with  dispersion 
parameter  lTk  and  degrees  of  freedom  dfk  (X(  '  ~  t(dfk)).  The  density  function 
of  (/f'fc')2  is  given  by: 


MO 


/    ,     \rfc/2-l     /  X  -W*+ffc)/2 


«#£    ^ 


v1  +  4f*, 


(3.6) 


where  B(-,-)  represents  the  beta  function.  Thus  (/?^)2  has  the  same  distribution 
as  d/fc(l  —  U)/LJ  where  {/  is  distributed  as  beta  with  parameters  7'<,./2  and  dfk/2. 
Before  calculating  the  Pitman  ARE,  we  derive  a  useful  moment. 


Lemma  3.2.5 
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provided  r*  +  2o  >  0  and  dfk  >  2(a  +  b). 
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Proof  of  Lemma  3.2.5   Let  T  have  the  density  h  given  in  (3.6).  Then 
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so  long  as  r^  +  2a  >  0  and  d/fc  >  2(a  +  6).  □ 


Theorem  3.2.5  If  X{k)  ~  t(dfk),  with  d/fc  >  4,  and  we  assume  that  Mx  =  M'2,  then 
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Proof  of  Theorem  3.2.5  Since 
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*,,  =  eWo  [rm]  eHo  [rmM(R{1))2)] 
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and  similarly, 
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The  moment  conditions  of  Lemma  3.2.2  are  easily  seen  to  be  satisfied  if  dfk  >  2,  but  as 
the  marginal  fourth  moments  must  exist  for  —  n  log  V  to  have  a  limiting  distribution, 
we  need  dfk  >  4.  Putting  these  expressions  for  <p\  and  (p2  in  (3.3)  gives  the  result.  □ 


Corollary  3.2.2  If  dft  =  df2  =  df  and  r\  =  r2  =  r  then 


ARE{Qn,-n\ogV;df;r) 
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(3.7) 


Interestingly,  the  expression  (3.7)  can  be  factored  into  the  Pitman  ARE  under  mul- 
tivariate normality  times  a  quantity  involving  only  df,  which  (obviously)  goes  to  one 
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as  df  — >  oo.  Thus,  for  example,  putting  df  =  5  yields 
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here  1024/8  Itt2  =  1.2809  or  putting  ?•  =  1  yields 


ARE(Q„,-»logV;#;l)  = 
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Again  we  include  some  graphs  of  the  Pitman  ARE  given  in  Theorem  3.2.5  for 
various  values  of  df  =  df1  =  df2.  We  could  calculate  the  Pitman  ARE  when  df  <  5, 
but  since  —  nlogV  is  guaranteed  to  have  a  limiting  distribution  only  when  df  >  5, 
the  value  would  be  meaningless.  Hence  we  consider  only  df  >  5.  Of  course,  we 
anticipate  that  Qn  is  vastly  superior  to  —  ?i  log  V  when  df  <  5,  but  we  do  not  have  a 
way  to  quantify  their  relative  performance  in  this  instance.  (We  do  include  df  =  1 
in  the  simulation  study,  which  will  demonstrate  if  our  intuitive  feeling  is  borne  out.) 

When  df  =  5  (Figure  3.6),  we  see  that  Qn  and  — n  log  V  perform  essentially  the  same. 
When  df  =  10  and  df  =  100  (Figures  3.7  and  3.8),  — nlogK  beats  Qn.  In  fact,  for 
df  =  10,  the  Pitman  ARE  is  already  only  1.057  times  that  of  the  multivariate  normal 
Pitman  ARE,  for  which  —  nlog  V  is  optimal. 
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0.75 


Figure  3.6.   l/(l  +  ARE(-nlog  V,  Qn;  df  =  5)) 
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Figure  3.7.  l/(l+ARE(Qn,  -nlog  V;  df  =  10)) 
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Figure  3.8.   l/(l+ARE(Qn,  -nlogV;  df  =  100)) 
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3.3     Model  2 
A  model  proposed  by  Pari  and  Sen  (1971)  is  given  by 
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x  = 


X(2) 


Y(2)  +  AZ(2> 


y(i)\  /Z(D 

y(2)     +A|Z(2) 


y  +  az, 


where  Y^\  Y*2'  and  Z  are  mutually  independent,  and  the  matrix  CovHo  Z*1',  Z'2' 
consists  entirely  of  nonzero  elements  (although  it  appears  that  it  is  sufficient  for  it  not 
to  be  the  zero  matrix).  They  state  that  such  a  model  may  prove  useful  in  analyzing 
group  tests  in  psychology.  For  example,  the  outcomes  of  two  reading  tests  and  two 
math  tests  can  be  described  by  a  (linear)  combination  of  individual  group  factors 
pertaining  to  the  reading  or  mathematical  abilities  and  common  factors  corresponding 
to  intelligence  or  comprehension.  In  general,  the  distribution  of  X  is  determined  by 
using  a  convolution  formula  since  it  is  a  sum  of  two  independent  random  vectors. 
However,  obtaining  a  closed  form  expression  for  the  density  function,  an  integral 
step  in  being  able  to  work  out  the  details  related  to  contiguity,  is  typically  non- 
trivial.  An  exception  is  when  Y'1',  Y^2'  and  Z  have  multivariate  normal  distributions. 
Thus,  if  Y(1)  ~  MVN(0,Jri),  Y(2)  ~  MVN(0,/r2),  and  Z  ~  MVN(0,S),  then 
X  ~MVN(0,/n+r2  + A2S). 

Since  the  determination  of  contiguity  and  computation  of  the  limiting  distribu- 
tions of  —  n\ogV,  Qn,  and  —  n\ogSJ°  under  this  sequence  of  multivariate  normal 
alternatives  is  virtually  identical  to  the  computation  under  alternatives  described  by 
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Model  1,  the  details  will  be  omitted.  The  end  result  is  that 

-nlogV  4  xvV2  (Aj>vec(E12)'vec(E12))  , 
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From  these  expressions,  we  observe  that  the  Pitman  ARE's  will  be  identical  to  those 
derived  from  Model  1  when  the  underlying  distribution  is  multivariate  normal.  This 
is  further  indication  that  the  actual  form  of  the  model  used  is  somewhat  irrelevant 
when  dealing  with  local  alternatives. 


CHAPTER   4 
MONTE  CARLO  STUDY 


4.1      Methods 


All  simulation  programs  were  written  in  the  C  programming  language.  Previously 
written  routines  acquired  from  various  sources  were  combined  with  original  source 
code  to  complete  the  main  procedure.  Outside  sources  included  a  large  archive  of  soft- 
ware maintained  by  AT&T  called  Netlib  <netlib@research.  att  .com>  and  another 
archive  of  statistically  related  software  maintained  by  Michael  Meyer  at  Carnegie 
Mellon  University  called  Statlib  <statlib@lib .  stat .  emu .  edu>.  Included  in  the  non- 
original  code  used  are  parts  of  the  following  libraries:  c/meschach — a  set  of  functions 
which  do  numerical  linear  algebra,  dense  and  sparse,  with  permutations,  error  han- 
dling and  input/output  by  David  E.  Stewart  <des@thrain .  anu  .  edu .  au>,  c/cephes — 
a  set  of  special  math  functions  and  IEEE  floating  point  arithmetic  by  Stephen  L. 
Moshier  <moshier@world.  std.com>  and  ranlib-c — a  set  of  random  variate  gen- 
erators translated  from  FORTRAN  by  Barry  Brown  <bwb@odin.mda.uth.tmc.edu>. 
Also  used  are  chisq.  c ,  f  .  c  and  z .  c,  which  are  functions  written  by  Gary  Perlman  to 
compute  probabilities  and  percentiles  of  the  chi-square,  F  and  normal  distributions 
and  Ll.f,  a  FORTRAN  routine  based  on  an  algorithm  by  Barrodale  and  Roberts 
(1974)  to  compute  the  least  absolute  value  solution  to  an  over-determined  system  of 
equations  (personally  translated  to  C).  The  final  program  was  compiled  using  gec 
(GNU  project  C  compiler  v2.4)  on  a  SPARC  10.  Several  programs  which  are  of 
interest  are  included  in  Appendix  C. 
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The  two  distribution  types  used  were  the  exponential  class  and  multivariate  /,  de- 
scribed earlier.  The  method  for  generating  observations  from  these  distributions  has 
three  parts  (see  Johnson,  1987).  First,  a  vector  of  iid  /V(0,  1)  random  variables  is 
generated.  Second,  the  vector  is  divided  by  its  Euclidean  norm,  which  results  in  a 
vector  uniformly  distributed  on  the  unit  hypersphere.  Third,  multiplication  by  a  pos- 
itive random  scalar  with  the  appropriate  distribution  (v4Gamma(r/(2i^.),  i)1/^"*.-) 


for  the  exponential  class  and  vdfklx%.  f°r  the  multivariate  t)  yields  the  desired 
multivariate  observation.  Model  1  (3.1)  was  then  used  to  generate  the  dependence 
structure.  For  ease  of  comparison,  we  restricted  the  study  to  cases  where  r\  =  r-2  =  r 
and  the  underlying  distribution  types  were  identical  for  each  set  of  variables.  Specif- 
ically, for  the  dimensions  ;■  =  1,2,  we  used  the  distributions  v  =  0.1,0.5,  1,  10  in  the 
exponential  class  and  df  =  1,5  in  the  multivariate  t.  For  r  =  3,  we  considered  only 
the  exponential  class  with  v  =  0.1,0.5,  1.  The  sample  size,  n,  was  kept  at  30  and 
the  number  of  repetitions  at  each  setting  was  2500  when  r  =  1,2.  Because  of  the  in- 
creased computing  time  needed  when  r  =  3,  the  number  of  repetitions  was  decreased 
to  1000.  To  gauge  the  precision  of  the  empirical  powers  computed,  we  provide  a  table 
summarizing  the  maximum  estimated  standard  errors  of  the  empirical  power  over 
various  levels  of  the  true  power. 

4.2     Statistics  Compared 

Of  course  —  n  log  V,  —n  log  .S'J°,  and  Qn  were  included  in  the  study,  but  numerous 
other  normal  theory  tests  not  explicitly  investigated  in  this  thesis  were  also  added  to 
judge  Qn  against.  In  particular,  the  standard  multivariate  tests  used  in  the  SAStm 
procedure  PROC  CANCORR  and  their  variants  were  considered.  These  tests  are 
all  based  on  the  sample  canonical  correlations  (/31?  • .  .,pn):  where  p\  >  •  •  •  >  p 
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Table  1.1.   Maximum  Estimated  Standard  Errors  for  Empirical  Power 


Power 

Repetitions 

1000 

2500 

0.00  -  0.05 
0.05  -  0.10 
0.10  -  0.25 
0.25  -  0.75 
0.75  0.90 
0.90  -  0.95 
0.95  -  1.00 

0.0069 
0.0095 
0.0137 
0.0158 
0.0137 
0.0095 
0.0069 

0  0044 
0.0060 
0.0087 
0.0300 
0.0087 
0.0060 
0.0044 

are  the  eigenvalues  of  S  =  A^  Ar2A^21  A'r2.  Several  of  the  tests  can  be  approxi- 
mated both  by  an  F  distribution  and  a  chi-square  distribution.  We  present  both 
forms  for  comparison.  First  is  — nlogV,  which  multiplied  by  the  Bartlett  correc- 
tion factor  1  —  (r!  +  r2  +  3)/(2ra)  (Box,  1949)  is  labeled  as  LX.  The  F  approxima- 
tion (a  different  transformation  of  V)  is  naturally  labeled  LF.  VF  and  VX  are  F 
and  chi-square  approximations,  respectively,  for  Pillai's  trace  YaLi  Pi-  Likewise  UF 
and  UX  are  F  and  chi-square  approximations,  respectively,  for  the  Lawley-Hotelling 
trace  YaLi  Pili}  ~  pi)-  The  last  normal  theory  test  is  RF,  an  upper  bound  on  an  F  ap- 
proximation to  Roy's  greatest  root  p\/{\  —  p\).  For  consistency,  we  label  —n\ogSJ° 
as  PS. 

The  Oja  median  (Oja,  1983)  was  used  in  Qn,  which  we  will  henceforth  call  Ql,  to 
estimate  the  nuisance  parameters  9\  and  9-2-  This  generalized  median,  which  is  the 
point  minimizing  the  sum  of  the  volumes  of  all  p-dimensional  simplexes  formed  from 
p  —  1  sample  points  and  itself,  is  equivariant  under  the  group  Q  and  asymptotically 
normal  with  rate  y/n  (Oja  h  Niinimaa,  1985).   When  p  =  1,  the  Oja  median  is  just 
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the  usual  univariate  median.  The  method  used  to  compute  it  is  based  on  the  Lj-norm 
formulation  of  the  minimization  as  given  in  Niinimaa  (1992). 
The  final  statistic,  Q2,  is  defined  as 

1  =  1  .7  =  1 


where 


U 


and 


ft*)  =         s.;21/2(x'2'  -  e2) 

yJ{XM  -~92y±n{XM  -02) 

and  was  included  because  it  is  asymptotically  equivalent  to  Ql,  but  is  much  simpler 
computationally.  Robust  M-estimates  of  Sn  and  £22  as  described  by  Randies,  Brof- 
fitt,  Ramberg,  and  Hogg  (1978)  were  used  in  Q2  as  well  as  the  same  Oja  median 
statistics  used  in  Ql  to  estimate  6\  and  02-  Thus  Q2  is  invariant  under  Q  and  it  is 
hoped  that  it  will  be  as  robust  as  Ql. 

4.3     Results 

The  outcome  of  the  Monte  Carlo  study  is  presented  graphically  in  a  series  of  figures 
at  the  end  of  the  chapter.  However,  the  values  used  in  these  figures  are  in  Appendix  C 
in  tabular  form  as  well.  In  order  to  facilitate  understanding  of  the  simulation  results, 
we  make  some  general  comments  for  specific  cases. 

When  r  =  1  (the  bivariate  case)  the  results  are  not  unexpected.  Since  Ql  and  Q2 
are  essentially  equivalent  to  q'  (a  very  robust  statistic),  it  is  easy  to  understand 
why,  for  the  heavy-tailed  distributions,  (see  Figures  4.1  and  4.5  for  graphs  of  u  = 
0.1   and   df   =    1)   both   Ql    and  Q2  do  better  than   the  normal  theory  tests.     It  is 
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important  to  keep  in  mind  that  by  saying  "better",  we  mean  that,  not  only  does  the 
test  have  higher  power  over  the  alternative,  but  that  the  test  has  at  least  come  close 
to  maintaining  the  designated  nominal  level  of  0.05.  For  tests  that  do  not  achieve  the 
latter  criterion,  it  is  difficult  to  compare  them  with  competing  procedures.  In  general 
however,  it  seems  prudent  to  be  biased  in  favor  of  procedures  which  maintain  the 
desired  level  versus  procedures  which  have  higher  power  but  do  not  maintain  the  the 
nominal  level.  Thus  although  there  are  instances  where  Ql  and  Q2  are  "beat"  by  other 
procedures  at  various  points  in  the  alternative,  for  the  most  part  Ql  and  Q2  are  much 
better  at  maintaining  the  designated  0.05  level,  indicating  their  favorability.  This  is 
the  case  here,  and  in  general,  for  heavy-tailed  distributions  like  the  multivariate  t  with 
df  =  1.  An  interesting  observation  is  that  although  PS  is  asymptotically  equivalent 
to  q' ',  it  doesn't  do  as  well  as  Ql  and  Q2  for  these  distributions.  In  fact,  it  is  uniformly 
worse  (conservative)  than  both  Ql  and  Q2.  One  reason  may  be  that  the  central  limit 
theorem  doesn't  work  quite  as  fast  on  the  log  of  a  sum  (like  PS)  as  it  does  on  a  strict 
sum  of  terms  (like  Ql  and  Q2).  We  note  that  in  this  case  the  normal  theory  tests 
perform  almost  identically,  with  the  exception  of  UX,  which  is  too  liberal.  Hence  for 
?•  =  1  we  do  not  differentiate  among  the  normal  tests,  except  to  exclude  UX.  As  the 
distributions  become  lighter  tailed  (see  Figures  4.2,  4.3,  4.4,  and  4.6  for  graphs  of 
v  =  0.5,  1,  10  and  df  —  5),  the  normal  tests  are  clearly  better  than  than  Ql,  Q2, 
and  PS. 

When  r  =  2,  we  examine  the  competitors  showings  in  greater  detail.  Since  RF 
and  UX  are  consistently  above  the  0.05  level  by  a  large  margin,  while  the  rest  of  the 
normal  theory  tests,  although  they  may  also  exceed  it,  are  less  liberal  and  perform 
comparably,  we  will  not  differentiate  among  the  normal  theory  tests  except  to  exclude 
RF  and  UX.  For  v  =  0.1  and  df  =  1,  (see  Figures  4.7  and  4.11),  both  Ql  and  Q2 
perform  well,  maintaining  the  0.05  significance  level  and  showing  a  steep  increase 
in  power.    PS  comes  close  to  doing  as  well,  but  overshoots  the  0.05  level  and  has 
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power  slightly  below  Ql  and  Q2.  None  of  the  normal  theory  tests  do  very  well.  In 
fart,  for  //  =  0.1,  they  all  have  uniformly  much  lower  power  than  Ql  and  Q2  while 
greatly  exceeding  the  0.05  nominal  level.  For  //  =  0.5  and  df  =  5  (see  Figures  4.8 
and  4.12),  Ql  and  Q2  seem  to  do  equally  as  well  as  the  normal  theory  tests,  with 
PS  again  exceeding  the  0.05  level  and  having  power  somewhat  less  than  the  others. 
For  v  =  1,  10  (see  Figures  4.9  and  4.10),  Ql  and  Q2  start  off  very  competitive,  but 
have  decreased  power  relative  to  the  normal  tests  as  the  dependency  is  increased.  PS 
again  is  uniformly  worse  than  the  others,  having  exceeded  the  0.05  level  and  having 
lower  power  over  the  alternative. 

When  ?•  =  3,  the  situation  remains  essentially  the  same.  Referring  to  Figures  4.13, 
4.14,  and  4.15,  RF  and  UX  are  uniformly  bad  with  Ql  and  Q2  doing  slightly  worse 
than  the  other  normal  tests  for  v  =  1  but  competitive  for  ;/  =  0.5  and  dominant  for 
7/  =  0.1. 

In  general,  it  appears  that  Ql  and  Q2  do  a  much  better  job  of  maintaining  their 
nominal  level  than  PS  or  any  of  the  normal  theory  tests.  Ql  and  Q2  are  consistently 
very  close  to  maintaining  the  0.05  nominal  level  where  the  others  vary  widely  either 
above  or  below.  This  implies  that  the  small  sample  applicability  of  Ql  or  Q2  is  very 
good.  We  might  add  that  most  of  the  simulation  results  presented  here  are  basically 
in  agreement  with  the  Pitman  ARE's  derived  in  Chapter  3.  For  example,  compare  the 
essentially  equivalent  performances  of  Ql  and  LX  when  v  =  0.5  with  the  associated 
Pitman  ARE  (see  Figure  3.3). 
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Figure  4.4.  r  =  1,  n  =  30,  v  =  10,  reps  =  2500 
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Figure  4.6.  r  =  1,  n  =  30,  df  =  5,  reps  =  2500 


65 


1.0  - 


0.9 


0.8  - 


0.7 


0.6  - 


CD 

o 

D. 

|      0.5 

E 

LU 


0.4  - 


0.3  - 


0.2  - 


0.1 


0.0  - 


0.0 


0.05 


0.10 


0.15 
Delta 


0.20 


0.25 


0.30 


Figure  4.7.  r  =  2,  n  =  30,  v  =  0.1,  reps  =  2500 
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Figure  4.8.  7-  =  2,  n  =  30,  v  =  0.5,  reps  =  2500 
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Figure  4.9.  r  =  2,  n  =  30,  v  =  1,  reps  =  2500 
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Figure  4.10.  r  =  2,  n  =  30,  ;/  =  10,  reps  =  2500 
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Figure  4.1 1.  r  =  2,  n  =  30,  df  =  1,  reps  =  2500 
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Figure  4.12.  r  =  2,  n  =  30,  df  =  5,  reps  =  2500 
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Figure  4.13.  r  =  3,  n  =  30,  u  =  0.1,  reps  =  1000 
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Figure  4.14.  r  =  3,  n  =  30,  v  =  0.5,  reps  =  1000 
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Figure  4.15.  r  =  3,  n  =  30,  v  =  1,  reps  =  1000 


CHAPTER    5 

APPLICATIONS 


5.1      Analysis  of  Newborn  Blood  Gas  Data 

The  State  of  Florida  mandates  that  measurements  be  taken  of  newborn  blood  at 
birth.  This  is  usually  done  from  the  umbilical  cord,  but  it  is  unclear  whether  practi- 
tioners routinely  get  umbilical  arterial  blood  or  umbilical  venous  blood.  It  is  further 
unclear  whether  these  values  are  related  to  a  simple  arterial  draw  from  the  abdomen 
that  can  be  done  one  hour  later  in  the  nursery.  Behnke,  Eyler,  Cordon,  Woods,  and 
Thomas  (1993)  investigate  the  use  of  birth  weight,  gestational  age,  Apgar  scores,  cord 
blood  gas  values,  and  first  arterial  blood  gas  values  as  diagnostic  criteria  for  perinatal 
asphyxia  and  subsequent  low  neurodevelopmental  outcome  in  very  low  birth  weight 
infants.  Their  data  consist  of  57  cases  for  which  good  blood  gas  information  exists 
at  all  three  sites — umbilical  venous  at  birth  (UV),  umbilical  arterial  at  birth  (UA), 
and  abdominal  arterial  one  hour  after  birth  (AA).  For  each  site  we  consider  three  of 
the  blood  gas  measurements  taken:  bicarbonate  (HCO3),  partial  pressure  of  carbon 
dioxide  (CO2),  and  partial  pressure  of  oxygen  (O2). 

We  are  interested  in  determining  if  there  is  any  relationship  between  the  UA  and 
UV  blood  draws  at  birth  and  also  if  either  of  these  are  related  to  the  AA  blood 
draw.  Intuitively,  since  the  UA  blood  originates  from  the  mother  and  the  UV  blood 
from  the  newborn,  we  expect  that  there  should  not  be  as  much  dependence  between 
UV  and  UA,  and  UV  and  AA  blood,  as  there  is  between  UA  and  AA  blood.  If 
these  conjectures  are  supported  by  the  data,  there  is  need  for  concern  with  respect  to 
the  methods  of  the  practitioners.   Practically  speaking,  the  goal  of  mandatory  blood 
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draws  on   newborn  babies  when  the  diagnostic  potential  of  the  measurements  may 
well  depend  on  the  technique  used  seems  a  flawed  idea  at  best. 

A  summary  of  the  tests  of  independence  is  in  Table  5.2.  Notice  that  all  the  normal 
theory  tests  detect  dependence  in  each  of  the  three  comparisons,  while  the  nonpara- 
metric  tests  only  indicate  dependence  between  the  UA  and  AA  measurements.  In 
fact,  there  is  strong  agreement  among  all  tests  that  there  is  dependence  between  the 
UA  and  AA  blood  gas  measurements.  A  visual  inspection  of  the  blood  gas  measure- 
ments however  (see  Figures  5.1,  5.2,  and  5.3  for  plots  created  by  xgobi — a  program 
which  allows  three-dimensional  rotation  of  the  data),  reveals  three  observations  in 
the  UA  determinations  which  are  quite  distant  from  the  remainder  of  the  data. 

After  deleting  these  observations  (28,  52,  and  55),  the  tests  were  re-run.  The 
results  are  fairly  telling.  The  normal  normal  theory  tests  involving  the  UV  blood 
gas  measurements  are  now  non-significant,  with  the  exception  of  RF.  Of  particular 
interest  is  how  drastically  the  p-values  of  these  tests  changed  with  the  deletion  of 
just  three  points  out  of  57.  This  would  seem  to  be  an  undesirable  property  of  any 
test.  Although  PS  managed  the  same  conclusions  as  Ql  and  Q2  (with  and  without 
the  outliers),  its  p-value  also  changed  significantly  making  it  suspect  also.  Only  Ql 
and  Q2  remained  largely  unaffected  by  the  removal  of  the  outliers. 

Although  this  data  is  useful  in  illustrating  Qn's  resistance  to  outliers,  it  is  deficient 
in  that  there  is  structure  present  which  is  ignored  by  all  these  statistics.  HCO3,  CO2, 
and  O2  are  repeated  in  each  set  of  variates,  but  because  of  the  invariance  of  the 
statistics  with  respect  to  the  labeling  of  the  variables  within  a  set  they  fail  to  take 
advantage  of  this  fact.  Thus,  there  are  probably  other  techniques  for  analyzing  this 
data  which  might  be  more  powerful.  The  next  section  illustrates  an  example  where 
there  is  no  such  natural  pairing. 
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Tab 


Newborn  Blood  Data 


Obs 

UV 

UA 

AA 

HC03 

C02 

o2 

I1('03 

co2 

o2 

HCO3 

co2 

o2 

1 

22.8 

37 

246 

22.4 

40 

34 

23.9 

40 

36 

2 

19.8 

28 

80 

16.4 

39 

28 

18.4 

36 

43 

3 

24.2 

45 

44 

22.0 

55 

9 

27.0 

53 

15 

4 

16.9 

18 

171 

23.3 

48 

21 

21.8 

39 

31 

5 

13.0 

19 

83 

22.3 

46 

9 

21.2 

37 

23 

6 

14.1 

26 

70 

22.4 

46 

21 

21.5 

40 

30 

7 

24.2 

48 

40 

19.3 

39 

29 

22.0 

42 

30 

8 

17.6 

39 

81 

21.8 

34 

26 

21.1 

29 

41 

9 

21.8 

68 

34 

15.6 

51 

33 

19.3 

44 

32 

10 

18.4 

99 

67 

22. 1 

43 

16 

20.3 

28 

36 

11 

16.4 

23 

110 

23.5 

50 

19 

22.8 

40 

25 

12 

19.8 

21 

63 

22.1 

47 

18 

23.2 

38 

27 

13 

22.8 

42 

73 

23.2 

48 

39 

23.0 

44 

52 

14 

17.2 

16 

365 

21.4 

42 

31 

22.8 

41 

37 

15 

13.2 

21 

280 

16.9 

43 

17 

19.0 

35 

27 

16 

23.9 

94 

66 

19.7 

42 

13 

19.0 

34 

26 

17 

20.6 

34 

169 

25.7 

54 

12 

22.3 

43 

23 

18 

22.5 

29 

207 

13.5 

27 

30 

22.5 

42 

39 

19 

15.1 

25 

66 

17.0 

62 

23 

20.6 

39 

64 

20 

16.8 

21 

434 

25.4 

47 

20 

22.9 

39 

51 

21 

24.8 

63 

45 

26.0 

58 

14 

25.0 

49 

23 

22 

23.0 

26 

49 

21.8 

40 

21 

22.9 

38 

29 

23 

19.5 

30 

74 

24.8 

50 

21 

24.4 

45 

25 

24 

20.9 

39 

39 

22.2 

42 

14 

20.0 

35 

20 

25 

19.6 

38 

48 

24.9 

45 

17 

23.3 

38 

29 

26 

22.1 

53 

48 

24.5 

42 

20 

23.2 

36 

31 

27 

17.0 

28 

85 

27.7 

70 

7 

24.8 

44 

30 

28 

16.8 

29 

47 

21.0 

111 

3 

20.3 

103 

6 

29 

19.6 

34 

194 

24.5 

57 

11 

26.4 

50 

27 

30 

17.0 

26 

283 

19.5 

44 

20 

18.6 

33 

28 

31 

18.2 

18 

64 

16.6 

56 

21 

21.2 

47 

31 

32 

22.4 

20 

271 

20.5 

38 

22 

21.3 

35 

32 

33 

27.0 

49 

36 

26.6 

50 

24 

25.5 

45 

34 

34 

21.5 

50 

56 

22.8 

51 

10 

20.6 

42 

24 

35 

19.7 

58 

35 

23.0 

46 

20 

22.4 

40 

38 

36 

23.1 

41 

225 

24.7 

45 

22 

22.2 

37 

27 

37 

22.2 

47 

60 

22.3 

58 

19 

20.7 

45 

27 

38 

18.8 

21 

173 

21.6 

51 

14 

25.3 

49 

25 

39 

11.4 

9 

87 

21.2 

46 

9 

26.2 

54 

14 

40 

21.1 

36 

131 

19.5 

37 

30 

19.3 

35 

28 

41 

23.6 

47 

198 

21.9 

45 

10 

21.8 

42 

15 
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Table  5.1:  — continued 


Obs 

UV 

UA 

AA 

HC03 

C02 

o2 

HCO3 

C02 

o2 

HCO3 

C02 

o2 

42 

18.8 

44 

46 

20.3 

57 

13 

21.0 

53 

19 

43 

12.3 

12 

300 

18.7 

45 

15 

20.2 

41 

15 

44 

13.9 

12 

270 

18.7 

45 

8 

22.8 

50 

22 

45 

21.9 

42 

194 

19.7 

58 

23 

19.6 

53 

26 

46 

16.6 

27 

27 

20.4 

42 

30 

19.6 

38 

70 

47 

19.4 

27 

70 

22.7 

42 

16 

22.4 

34 

23 

48 

21.3 

28 

38 

23.9 

49 

15 

23.9 

46 

21 

49 

20.8 

44 

299 

21.9 

43 

24 

18.7 

32 

40 

50 

21.4 

42 

163 

24.3 

51 

16 

23.3 

44 

25 

51 

19.6 

38 

199 

21.6 

41 

29 

21.3 

33 

40 

52 

26.2 

37 

135 

25.6 

23 

51 

25.6 

44 

56 

53 

16.1 

26 

40 

21.1 

35 

25 

20.2 

30 

35 

54 

19.5 

37 

194 

22.2 

56 

14 

19.4 

42 

26 

55 

11.1 

27 

42 

16.9 

115 

11 

17.2 

96 

18 

56 

15.2 

25 

94 

19.6 

46 

19 

18.6 

32 

28 

57 

22.2 

50 

76 

21.3 

38 

25 

22.0 

38 

28 

5.2     Analysis  of  Fitness  Club  Data 


The  section  of  SAS/STAT  User's  Guide,  Volume  1,  which  describes  the  SAS  pro- 
cedure PROC  CANCORR  has  an  example  that  uses  data  provided  by  Dr.  A.  C.  Lin- 
nerud,  North  Carolina  State  University,  in  which  three  physiological  variables  and 
three  exercise  variables  were  measured  on  twenty  middle-aged  men  in  a  fitness  club. 
It  demonstrates  how  PROC  CANCORR  can  be  used  to  determine  if  the  physiological 
variables  are  related  in  any  way  to  the  exercise  variables. 

Here  again  we  have  potential  outliers,  which  we  can  see  from  examining  Figures 
5.4  and  5.5.  A  summary  of  the  statistical  analysis  of  this  data  (with  and  without 
observations  10  and  14)  is  in  Table  5.4.  Interestingly,  the  p- values  for  the  normal 
tests  all  decrease  while  the  p-values  for  the  nonparametric  tests  all  increase  with  the 
removal  of  the  outliers. 
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Table  5.2.  Statistical  Analysis  of  Newborn  Blood  Data 


UV  vs.  UA 

UV  vs.  AA 

UA  vs 

.  AA 

Statistic 

Value 

P- value 

Value 

P-value 

Value 

P-value 

LF 

2.5649 

0.001)7 

2.1290 

0.0317 

35.6598 

0.0000 

VF 

2.3596 

0.0158 

2.0662 

0.0356 

22.0281 

0.0000 

UF 

2.7189 

0.0058 

2.1519 

0.0285 

45.9539 

0.0000 

RF 

7.8735 

0.0002 

5.2465 

0.0030 

122.2455 

0.0000 

PS 

12.8302 

0.1704 

7.0634 

0.6305 

70.6444 

0.0000 

Ql 

4.9852 

0.8356 

9.6629 

0.3785 

82.9273 

0.0000 

Q2 

5.2102 

0.8156 

8.1707 

0.5170 

82.5464 

0.0000 

after  the  outliers  are  removed 

LF 

1.1296 

0.3476 

1.0216 

0.4271 

14.7553 

0.0000 

VF 

1.1137 

0.3564 

0.9938 

0.4477 

12.8989 

0.0000 

UF 

1.1386 

0.3397 

1.0446 

0.4080 

14.5733 

0.0000 

RF 

3.2089 

0.0308 

3.2942 

0.0279 

29.3290 

0.0000 

PS 

6.4742 

0.6917 

4.9757 

0.8364 

59.2939 

0.0000 

Ql 

3.2630 

0.9530 

9.4586 

0.3961 

75.7127 

0.0000 

Q2 

3.8196 

0.9229 

7.4735 

0.5880 

75.4526 

0.0000 

5.3     Analysis  of  Cotton  Dust  Data 


Merchant  et  al.  (1975)  studied  the  effects  of  cotton  dust  exposure  on  human 
beings  by  measuring  several  respiratory  variables  and  several  blood-related  variables 
on  12  subjects  exposed  for  six  hours.  The  data  consist  of  changes  in  these  variables 
from  baseline.  It  may  be  of  medical  interest  to  determine  if  these  two  sets  of  variables 
are  independent  or  not.  Included  among  the  respiratory  variables  are  closing  capacity 
(CC),  vital  capacity  (VC),  and  total  lung  capacity  (TLC).  Two  blood-related  variables 
are  oxygen  (O2)  and  white  blood  count  (WBC). 
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Table  5.3 

.   Fitness  Club  Dat 

a 

Obs. 

Weight 

Waist 

Pulse 

Chinups 

Situps 

Jumps 

1 

191 

36 

50 

5 

162 

60 

2 

189 

37 

52 

2 

110 

60 

3 

193 

38 

58 

12 

101 

101 

4 

162 

35 

62 

12 

105 

37 

5 

189 

35 

46 

13 

155 

58 

6 

182 

36 

56 

4 

101 

42 

7 

211 

38 

56 

8 

101 

38 

8 

167 

34 

60 

6 

125 

40 

9 

176 

31 

74 

15 

200 

40 

10 

154 

33 

56 

17 

251 

250 

11 

169 

34 

50 

17 

120 

38 

12 

166 

33 

52 

13 

210 

115 

13 

154 

34 

64 

14 

215 

105 

14 

247 

46 

50 

1 

50 

50 

15 

193 

36 

46 

6 

70 

31 

16 

202 

37 

62 

12 

210 

120 

17 

176 

37 

54 

4 

60 

25 

18 

157 

32 

52 

11 

230 

80 

19 

156 

33 

54 

15 

225 

73 

20 

138 

33 

68 

2 

110 

43 

Table  5.4.  Statistical  Analysis  of  Fitness  Club  Data 


Outliers  In 

Outliers  Out 

Statistic 

Value 

P-value 

Value 

P-value 

LF 

2.0482 

0.0638 

2.4847 

0.0305 

VF 

1.5587 

0.1551 

1.5877 

0.1504 

UF 

2.4938 

0.0238 

3.4540 

0.0045 

RF 

9.1986 

0.0009 

13.5137 

0.0002 

PS 

NA 

NA 

15.4118 

0.0802 

Ql 

12.9838 

0.1633 

11.8387 

0.2226 

Q2 

13.4917 

0.1416 

12.5776 

0.1827 

•       A*. 
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Figure  5.1.  Umbilical  Venous  Blood  Gas  Measurements 
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Figure  5.2.  Umbilical  Arterial  Blood  Gas  Measurements 
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Figure  5.3.  Abdominal  Arterial  Blood  Gas  Measurements 
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Figure  5.4.   Physiological  Measurements 
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Figure  5.5.  Exercise  Measurements 
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Table 

3.5.  Cotton  Dust  Data 

Obs. 

CC 

VC 

TLC 

o2 

WBC 

1 

-4.3 

0.24 

-0.11 

-1.0 

6000 

2 

4.4 

-0.29 

-0.01 

-1.5 

-350 

3 

7.5 

0.10 

0.67 

1.0 

-250 

4 

-0.3 

0.13 

0.31 

6.5 

1675 

5 

-5.8 

0.02 

-0.75 

-3.0 

875 

6 

14.5 

0.48 

1.14 

-3.0 

-100 

7 

-1.9 

-0.05 

-0.22 

-15.5 

1075 

8 

17.3 

-0.62 

0.62 

-13.5 

1675 

9 

2.5 

-0.16 

0.12 

0.0 

1500 

10 

-5.6 

0.15 

-0.14 

-4.0 

2200 

11 

2.2 

0.25 

0.40 

-2.5 

650 

12 

5.5 

-0.42 

0.22 

-1.0 

3025 

Table  5.6.  Statistical  Analysis  of  Cotton  Dust  Data 


Statistic 

Value 

P-value 

LF 

0.8623 

0.5452 

VF 

0.9339 

0.4976 

UF 

0.7783 

0.6024 

RF 

1.6581 

0.2521 

PS 

2.4917 

0.8694 

Qi 

14.7390 

0.0224 

Q2 

13.4729 

0.0361 

CHAPTER   6 
CONCLUSION 


6.1      Discussion 


With  the  increasing  availability  of  very  fast  computers,  multivariate  and  nonpara- 
metric  procedures  which  would  have  been  impossible  for  the  average  statistician  to 
implement  several  years  ago  can  easily  be  utilized  today.  This  fact  is  responsible 
for  the  explosion  of  such  multivariate  and  nonparametric  methodologies  seen  cur- 
rently. The  goal  of  this  dissertation  has  been  to  research  one  such  methodology,  the 
interdirection  quadrant  statistic  (Q„),  in  testing  for  independence  between  two  sets 
of  variates.  The  basic  competitors  considered  were  Wilks'  likelihood  ratio  criterion 
(— nlog  V)  and  a  specific  member  of  a  class  of  statistics  invented  by  Puri  and  Sen 
(— 7i  log  ,S  °).  As  demonstrated  in  Chapter  3,  the  Pitman  ARE's  indicate  that  Qn 
does  quite  well  relative  to  —  nlog  V  for  heavy- tailed  distributions  and  is  competitive 
for  moderate-tailed  distributions.  The  statistic  Qn,  under  spherical  alternatives,  ap- 
pears to  be  uniformly  better  than  its  natural  nonparametric  competitor  —  n\ogSJ°. 
Simulation  results  concur  with  theoretical  findings  in  the  sense  that  the  empirical 
powers  of  the  competitors  are  ordered  in  the  same  way  as  the  Pitman  ARE's  indicate 
they  should  be. 

6.2     Further  Research 

Other  avenues  of  research  might  be  to  investigate  the  potential  of  the  interdirec- 
tion quadrant  statistic  (i)  for  describing  the  nature  of  the  association  between  two  sets 
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of  variates  instead  of  merely  using  it  as  a  test  of  independence,  or  (ii)  for  determining 
which  variables  in  a  set  may  or  may  not  be  contributing  to  an  association  between 
the  two  sets.  It  might  also  be  desirable  to  make  comparisons  with  other  members  of 
Puri  and  Sen's  class  of  statistics,  like  a  multivariate  analog  of  Spearman's  rho.  Com- 
puting issues  that  arose  during  the  simulation  study  also  lead  to  possible  research 
areas.  Since  the  time  to  compute  the  interdirections  (based  on  simple  looping  algo- 
rithms) is  on  the  order  of  n  ,  where  k  is  the  dimension,  for  even  moderate  sample 
sizes,  use  of  the  interdirections  becomes  impractical.  Possible  work-arounds  to  this 
limitation  might  be  (i)  to  find  a  suitable  approximating  statistic  (which  might  entail 
more  simulation  work),  (ii)  to  derive  a  faster  algorithm  for  computing  the  interdirec- 
tions, or  possibly  (iii)  to  estimate,  by  using  some  sampling  method  for  instance,  the 
interdirections.  Of  course,  since  the  interdirections  have  wider  applicability  than  the 
present  independence  testing  situation,  these  results  would  naturally  have  broader 
appeal  than  this  context.  The  statistic  Q2  defined  in  Chapter  4  and  included  in  the 
simulations  there,  is  an  example  of  a  preliminary  step  in  examining  the  feasibility 
of  (i).  It  appears  to  do  quite  well,  so  that  this  seems  to  be  a  promising  idea. 


APPENDIX   A 
CONVERGENCE  RESULTS 


A  recurring  theme  in  the  proofs  of  the  lemmas  to  follow  is  the  re-expression  of 
certain  terms  arising  from  Taylor  series  expansions  as  (/-statistics.  This  is  important 
in  order  to  apply  existing  (/-statistic  theory  to  these  quantities.  Specifically,  we 
will  have  occasion  to  use  Theorem  2.8  in  Randies  (1982),  which  we  restate  here. 
Randies  considers  random  variables  which  would  be  (/-statistics  were  it  not  for  the  fact 
that  they  contain  an  estimator.  Let  X\,...  ,Xn  denote  a  random  sample  from  some 
population.  Let  h(x\, . . . ,  xr;  7)  denote  a  symmetric  kernel  of  degree  7'  with  expected 
value  ^(7)  =   E\[h(X\, . . . ,  XT;*</)],  where  A  denotes  the  true  value  of  an  unknown 

parameter  7.     Let  Un('f)  =    ("J      YlaeA*  h(Xai, . . .  ,Xar;7),  where  A*  denotes  the 

collection  of  all  subsets  of  size  r  from  the  integers  {1, . . . ,  n}.  Consider  the  following 
conditions: 


Condition  1.     Suppose 

(A  -  A)  =  Op(n-^2). 

Condition  2.  Suppose  there  is  a  neighborhood  of  A,  call  it  A'(A),  and  a  constant  K\  > 
0,  such  that  if  7  6  A'(A)  and  D(j,d)  is  a  sphere  centered  at  7  with  radius  d 
satisfying  D(y,d)  C  A(A),  then 


sup 

yeD(7,d) 


h(Xi , . . . ,  XT;  7')  -  h(Xi , . . . ,  XT;  7) 


<  K,d. 


Condition  3.     Suppose  there  exists  an  M\  >  0  such  that 

\h(x1,...,xT; 7)  -  h{xu...,xT;\)\  <  M, 

88 


89 
for  every  [x] ,...  ,xr)  and  all  7  in  some  neighborhood  of  A. 

Theorem  A. 0.1    If  Conditions  1,  2,  and  3  are  satisfied,  then 

£/n(A)  -  p(A)  -  Un(X)  +  fi(X)  =  op{n-x'2). 

Theorem  A. 0.1  also  holds  in  the  multivariate  setting,  so  we  can  readily  apply  it  to 
the  present  situation.  Also,  since  all  the  kernels  we  will  work  with  are  bounded, 
and  (#!  —  0\)  =  (3p(n-1/2),  we  will  only  be  concerned  with  satisfying  Condition  2  in 
Theorem  A.0.1.  In  the  lemmas  that  follow,  the  sufficient  conditions  will  not  be  given 
in  the  statement  of  the  lemma  since  they  are  quite  technical  and  the  lemmas  are  not 
of  importance  in  and  of  themselves.  Rather,  the  relevant  conditions  will  be  noted  as 
they  are  needed.  We  now  prove  the  first  of  the  necessary  limiting  results. 

Lemma  A.0.1   c  =  op(n-1/2). 

Proof  of  Lemma  A.0.1   Define 

Ki(7i)=  (2)      £cos(Tft(»,j';7i)), 

where 

(i,j)i<x 

and  h,{X\l\x(jl)]X^,...,X£]ri_i;1,)  indicates  whether  (1)  or  not  (0)  X^  and 

Xj     are  on  opposite  sides  of  the  hyperplane  formed  by  X£\...,X^  _    and  yx. 

Since  c  =  V(0i),  our  immediate  goal  is  to  express  bounds  on  ^(7^  in  terms  of 
(/-statistics  so  that  we  can  apply  Theorem  A.0.1.  Using  a  Taylor  series  expansion 
of  cos  x  around  the  point  y,  (cosx  =  cosy  —  (x  —  y)  siny  —  (l/2)(x  —  y)2 cos  z  for  some 
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between  x  and  y),  we  have  the  following  representation: 
K>(7i)=  (")      $3cos(tPi(*,J;7i)) 

-7r(o)       H{/'i('^;7i)  -Pi(*J;7i)}sin(7rp1(i,.7;7l)) 


*<J 


jr2/nN-1    " 


(A.l) 


9      S{pi(*ii;7i)-pi(«ii;7i)}  «*(*?&)> 


'2V2,       t<J 


where  Pl(i,j;7i)  =  E„o  [MX*1',  Xf>;  X«  •  •  • ,  *£>  _,;  7l)  |  X^.X^]  and  pf.  is 
a  value  between  fii(i,j;  7l)  and  p\(i,j;  7,).  If  we  define 

Kn(7i)=  L)       ^cos(7rp1(i,j;71)), 
V2n(7i)  =  L)      S{pi(*,i;7i)  -PiCiiJJTOjsinCTpi^jjTr!)), 


anc 


v3n(7i)=  (.J    £{pi(*ii;7i)-pi(*ii;7i)}  > 

then  it  follows  from  (A.l)  that 

|K(7l)|  <  |Vi»(7i)l  +  t|^(7i)I  +  (ira/2)|%.(7i)|.  (A.2) 

Since  Randies  (1989)  has  shown  that  his  interdirection  sign  statistic  has  a  limiting 
chi-square  distribution  under  the  null  hypothesis  of  a  known  location  parameter, 

in  our  notation   this  result  is  equivalent  to  r\((n  —  l)Vjn(0i)  +  1)   — »   xl  ■     Thus, 

V\n{0\)  is  Op(n_1).  Applying  Theorem  A. 0.1  and  a  Taylor  series  expansion  of  Hvin(Q\) 
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around  0\,  we  get 

M$i)  =  M*i)  +  (Mvln(»i)  -  Pvln(«i))  +  Op(n_1/a) 
=  M«i)  +  (^-^i)'^vlB(«t)  +  Op(n-1/a) 
=  Opfa-1)*  O'^n-^^o^l)  +  op(ri-1/'2) 
=  Opln-1/'2), 

if  V/iyIn(7,)  exists  and  is  continuous  at  #1,  since  then  V^Vin(7i)|-y,=e,  =  0  (the  direc- 
tional derivative  at  6]  is  constant)  and  9\  =  di+op(l).  We  label  the  existence  and  con- 
tinuity condition  as  (Cl).  Condition  2  is  satisfied  for  the  kernel  cos(7rpi(aij    ,  x\   ;  7X)) 
if  it  is  satisfied  for  the  kernel  Pi(x\    ,x-2    ,1fi))  and  we  label  this  condition  as  (C2). 
With  a  little  work,  V2n(7i)  is  seen  to  be  a  multiple  of  a  ^/-statistic. 

vM  =  (ny{r~2y£    S    ^fUf;xW...,xW_i;7l) 


ft  c; 


-1 

^2n(7l)i 


2  ; 

where 

</2l^-i     ,A2    ,A3    ,  ...,An+1,7j 

=  {^(XWX«;XW1...IXW1;71)-P1(XS1\X«;71)} 

xsin^X^At'^i)), 
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and  g\  is  a  symmetrized  version  of  the  kernel  gi-    ^2?i($i)  is  at  least  a  first-order 
degenerate  (/-statistic,  which  is  seen  by  observing  that  (taking  Q\  =  0) 

EWo[ri(Xi,\...,X?+i)|X^ 

=  a,  •  0  +  a2  -0  =  0. 
The  latter  follows  from  the  fact  that 


E«„ 


{&1(Xp)IX?>;X,«...,Xg>+1)-Pi(xW,X«)} 


x  sm^Trpi^A]    ,  A2    )J  |  A,    ,A2 


{Elf0[MX{1)IX«;X?>I...lXg>+1)|X«Xa« 
xsmCirpjCX^.X^)) 

0  •  sin(7rpi(A"i    ,  X 2  '))  (by  definition) 

0, 


-ft(Xp>,X2W)} 
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and  since  the  X^''s  are  jointly  symmetric  about  the  origin, 

F       \n  (X{1)    X(1)-  X{1)  Y(1)    1  I  Y(,)  Y{])    1 


E 


Wo 


{fc1(^,x?>;xf>j...)xg)+1)-p1(xw,x?>)} 


-(1)      v(l) 


1) 


■(1) 


=  Eh0 


xSm(TrMX\>\X^))\X^  >,...,  XY>+1 


{hr(-X?\x?\x£\...,xUi)-M-X?\X?)} 


-(1)     vO 


(1) 


-(1) 


Etf0 


xsin(7rPl(-Xr,X^))l^r,---,^;;+i 


{(1-/,1(X51)1X.«1»;X.<1),...,X^1))-(1-Pr(^,),^1)))} 


"(1)     y(l) 


(1) 


-(1) 


-E//o 


xsin(7r(l-Pl(X^,XH))l^r,-.-,^;Vi 


{fc1(X«xW;XW...,JCj;>+1))-Pi(X«X?>)} 


-(i)    V(J) 


(i) 


-(i) 


xsin(7rPl(Xi",X^))|XV',---,^^i 


—  _EjHo  [SHai    ja2    iA3    )■  ■  -j^n+lJ  I  A3    j-'-j-An+lj  5 

which  implies  that  EHo  [g^X^X^;  X{3'\  . . .  ,X™+i)  I  X$\  ■  ■  ■  ,*S-i]  =  °-  Then 
(Lee,  1990,  p.  83)  implies  that  U-2n(9\)  =  0p(ra-1).  Applying  Theorem  A. 0.1  we  have 

cM0l)   =  ^2n(0l)  +  <>>~1/2) 

=  Op(n-1)  +  op(rr1/'2) 
=  o^n"1/'2), 

since  EHo  [g3{X^\X^)]xS),...,X^)+l-1y1)  |  XfU^]  =  0  regardless  of  the  value 

of  7j,  and  hence  EH  [^2n(7i)]  =  0  for  all  f1.  Thus  V-27i{0\)  =  op(??_1/'2).  Condition  2 

is  satisfied  if  it  holds  for  the  kernel  /*i(a:i    ,  se2 ;x^  ',...,  x*i+1,  7J,  which  we  label 
as  condition  (C3).  We  also  need  (C2)  to  hold  again. 
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It  turns  out  that  \/3„(71)  is  bounded  by  a  multiple  of  a  ^/-statistic  plus  a  positive 
constant. 

M7i) 


<c„ 


c 


>rA    72r,  -r 


r,  -  1 


-l  /       \  -l 
2r, 


(i) 


2r,  -  2\      /2r, 


fi  - 


/3e  C** 
%(7i)  +  (1  -  C.), 


£    53*W/---^£1;7i)  +  (i-c„) 


where 


„fy(l)     yU).   yO)  yO)     .   yO)  yU)  .  -,   \ 

=  {/Jl(x!1Uln;^,,..,x[;)1;71)-MX!1US,);.1)} 

with  Cn  =  ("~r17  )/("1'i)  an<^  9s  a  symmetrized  version  of  the  kernel  g3.  Some 
simple  algebra  reveals  that  \  —  Cn  =  0(n~x).  Also,  U^n{6x)  is  at  least  first-order 
degenerate  following  the  same  reasoning  as  for  U2n{0\)-  We  have  (taking  B\  =  0) 
that 


K 


Ho 


A(xP,...,x}Z)\x\ 


(ih  i  v0) 
i 


-„F       Lfv(')    yO).  yO)  yO)    .  yW  yOnyOl 

—  "l^Ho   p^l-A-i     ,^-2    )^3    )••  ■5-Ar1+l>-A-ri+2»--  •»-A2riJ  I  A1 

4-/,  F      LfyCI    y(').  yf1)  y(!'    .  y(J>  y(1hl3T(1)l 


=  ax  •  0  +  a2  ■  0  =  0, 
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which  follows  since 

■^HoplAl     iA2     >-<*-3     )  •  ■  •  i-^-ri+li  Ar,+2'  •  •  •  J -*■  2r,  J   I  A1      ) -*■  2     i-<*-3     (■■•)An+l 


=  Ef/0 


{k1(xS1\xf);Xf>,...)^1)-Pi(^1),Jf?))} 

x{/,1(X^,X21);X(;U...,x(V1)-p1(^1),X21))} 


v*1)   v*1'   v*1)  v'1) 

Al     i^2     1^-3     r--)Ari+l 


{fc1(xS1\x<,>;jri1>,...Ixg>+1)-Pi(xi1),x?))} 

*   \EWo    [Bi(a]      ,X2     ;  Xn+2,  •  •  •  ,  -X"2rj)    I   ■X'l      ) -^2 


-Mx[l\x^)} 

(by  definition) 


=  0. 


Again,  (Lee,  1990,  p.  83)  shows  that  U3n(0i)  is  Op(n   l).  Since  EH  [^3n(7i! 
all  7l3  applying  Theorem  A. 0.1  shows  that 

U3n(01)  =  U3n(01)  +  op(n-1'2) 

=  Op(n-1)  +  op(n"1/2) 

=  o^n-1/2), 


Ofor 


so 


that 


V3n(8i)  =  (1  +  OCn-1))^*-1'2)  +  0(0 
=  op(n-^2). 

No  further  conditions  than  those  already  stated  for  Vin{6\)  and  V2n(#i)  are  needed. 
Since  Vl7l(0i),  V2„(0i),  and  V3n(fli)  have  all  been  shown  to  be  op(n-1'2),  referring 
to  (A. 2)  leads  to  the  conclusion  that  Vn(6x)  =  op(n-1'2)  as  well.  □ 


Lemma  A .0.2  rf  =  op(n   1/2). 
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Proof  of  Lemma  A .0.2  After  some  algebra,  the  proof  is  quite  similar  to  the  proof  of 
Lemma  A.0.1,  (with  similar  assumptions)  so  it  is  omitted.  □ 

Lemma  A .0.3  c2  =  op(n5/2). 

Proof  of  Lemma  A .0.3   In  light  of  the  limiting  behaviour  of  c,  we  need  only  work  with 

E     cUhJhWhJh). 

Define 

^n(7i)=(")  E     cos(7rp1(/i1,/i2;71))cos(7rp1(/91,/i3;7l)). 

W      PeDl 

Again  using  a  Taylor  series  expansion  of  cos  x  and  previous  limiting  results,  it  suffices 
to  show  that  each  of  W\n(0\),  M/2„(^i),  and  M^n(^1)  is  op(n-1'2),  where 

Win(7i)=L)         E     cos(7rPi(A^2;7i))cos(7ru1(/i1,/53;7,)), 

^n(7i)=(")      E  {A(A,A;7i)-pi(A,A;7i)} 

x  sin(7rpi(/i1,/3.2;71))cos(7rp1(/31,/33;71)), 


and 


-l 


w/3„(71)=    ,        E  {A(A,A;7i)-pi(A,A;7i)} 

x{p,(/?i,A;7i)-Pi(i9i,Aj;7i)} 

x  sin(7rp1(/i1,/i2;  7, ))  sin(7r;j, (/?, ,  /33;  7, )). 

Analysis  similar  to  that  in  the  proof  of  Lemma  A.0.1  using  the  repeated  application  of 
Theorem  A.0.1  (and  with  no  further  assumptions  than  those  needed  in  Lemma  A.0.1) 
proves  the  result.  □ 
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Lemma  A .0.4  d2  =  op(n5/2). 

Proof  of  Lemma  A. 0.4   After  some  algebra,   the  proof  is  again  quite  similar  to  the 
proof  of  Lemma  A. 0.3,  so  it  also  is  omitted.  □ 

Lemma  A .0.5  cx  =  0(n2). 

Proof  of  Lemma  A.0.5  It  suffices  to  note  that  c(i,j)  and  c  are  both  bounded.  □ 

In  order  to  simplify  the  proof  of  dx  =  op(n2),  and  because  it  will  be  of  general  use, 
we  prove  the  following  lemma. 

Lemma  A.0.6  Pk(i,j;Ok)  =  Pk{i,j]0k)  +  op(l). 


Proof  of  Lemma  A. 0.6  Let  e  >  0,  6  >  0,  and  Condition  2  be  satisfied  with  h  replaced 
by  hk(x\    ,  x\    ;  xy, . . . ,  *rfc+1;7fc)-  Then  we  can  choose  a  sphere  D(dk,d*)  so  that 


B  =  EH, 


sup 

T*6D(flfc,d*) 


-(*)    v(fc).  v(fc) 


(fe) 


hk(X\">,X)T;XZ\...,X).:>+1;lk) 


<  e/2. 


Now, 


P    WM'i^)-P*j;^-)|>e 


<p 


sup    |pfc(»,j;^fc  +  t)-pjfc(i,j;^fc)l  >  e 

itlKrf* 


+  P  \\ok-ok\\>d* 


which  for  n  sufficiently  large, 


<P 


sup   \ph{i,j;Ok  +  t)  -  Pk(i,j;Ok)\  >  t 
ItlKd* 


*/S 


■4-£/9 


P  [f/B-  >  e]  +  */2, 
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where  U*  is  almost  a  f /-statistic  (X,      and  X2     are  common  to  all  terms)  which 
estimates  B.  Using  (Serfling,  1980,  Theorem  A,  p.  201)  yields 

?[U:  -B>  e/2]  =  EHo  [EHo  [l(ir  -B>  e/2)  |  x\k\xf]]\ 
=  EHo\?\u*n-B>e/2\X<ik\xW 


<EHo[exp(-(l/2)Ln/(r2-l)J(e/2)2) 
=  exp(-K(r2-l)J(e2/8)) 


which  for  n  sufficiently  large, 


<  6/2. 

Thus  for  n  sufficiently  large, 
P  [\Pk(i,j;0k)-Pk(i,r,ek)\  >  e]  <V[U*n  >  e]  +  6/2 

<P[U*-B>  e/2]  +  8/2         (since  e/2  +  B  <  c) 

<  6/2  +  6/2  =  6.        D 

Lemma  A .0.7  dx  =  op(n2). 

Proof  of  Lemma  A.0.7  Since  d  =  op(n~^2)  and  d(i,j)  <  2,  it  suffices  to  show  that 
E?,y  \d{i,j)\  =  op{n2).  This  will  follow  if  EHo  [\d(i,j)\]  -»0asn->oo  because  d(i,  j) 
is  bounded.  Now 

EWo  [\d{i,j)\]  =  EHfl  [|cos(7rp2(i,j;02))-cos(7rp2(2,j;02))| 


<  ttEHo  [|p2(*',j;02)  -i>2{i,3\Qi) 

->o, 

with  the  last  statement  following  from  Lemma  A. 0.6  and  the  fact  that  the  integrand 
is  bounded.  □ 
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We  now  sketch  an  argument  that  in  the  bivariate  case,  (CI),  (C2)  and  (C3)  are 

attainable  conditions  with  some  mild  assumptions  on  the  underlying  density  /.    Let 

D  and  D'  be  circles  with  radii  d  and  2d,  respectively,  centered  at  7,  and  contained 

within  the  circle  K.  Restrict  K  to  be  inside  the  unit  circle.  Let  A\(x,  D)  be  equal  to 

the  probability  that  a  random  variable  Z  with  absolutely  continuous  density  function 

/  falls  in  one  of  the  two  extended  triangular  regions  above  and  below  x  as  shown  in 

Figure  A.l.  Then  for  a  given  X\  and  x2, 

sup  \p{xux2;j)  -p{xux2]j')\  <  A1{xl,D)I(xl  £  £>')  +  A1(x2,  D)I(x2  &  D') 
7'eo 

+  I(xt  or  x2  e  D'), 

so  that  it  suffices  to  show  E„o  [A1(X,D)I(X  0  D')}  <  kxd  and  P(X  e  D')  <  h2d 
for  some  constants  fcj  and  k2  in  order  to  satisfy  (C2),  where  X  also  has  the  density 
/.  Assume  that  /  is  spherically  symmetric  so  that  f(x)  oc  g(r2),  where  r  =  ||sc||. 
Further  assume  that  g(r2)  is  non-increasing  in  r2  (unimodal)  with  g(0)  finite  (so  that 
/  is  bounded).  Then  A\(x,D)  is  maximized  when  7  is  on  the  line  through  x  and 
the  origin.  Referring  to  Figure  A. 2,  A\{x,D)  <  AA2(x,D),  where  A2(x,D)  is  the 
probability  that  Z  falls  in  the  extended  triangular  region  below  x.  Now  for  x  £  D', 
(i.e.,  ||sb-7||  >2d), 

a  =  sin^dlas  -  7||_1d)  <  (2/>/3)||as  -  fW^d. 

It  then  follows  using  some  basic  trigonometry  (see  Figure  A. 3)  that 

A2(x,  D)  <  (4/V3*)||a!  -  711"^+  (2r2/VS)\\x  -  jW^d. 

Thus,  we  need  E^  [||JST  —  -y||-1]  <  00  and  EHo  [R2\\X  -  7H"1]  <  00  (R  =  \\X\\). 
Both  of  these  moments  can  be  shown  to  be  finite  if  in  addition  to  the  assumptions  on 
/  we  add  EH  [R2]  <  00.  This  can  be  seen  as  follows.  Clearly  EH  [\\X  —  7||_1]  <  00 
if  and  only  if 


EH\\\X-1\ri(\\X-1\\<c) 


<  00 
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for  c  a  positive  number.    Because  of  the  symmetry  of  the  problem,  without  loss  of 
generality  take  7  =  (7,0)'  and  0  <  7  <  1/2.   For  c  =  1/2, 

Ello[\\x-1\ri(\\x-J\\<, 

=  //((.n-7)'2  +  .^)-,/2/(||X-7||<  L/2)/(x„ x2)dxxdx2 

<  /(0,0)  /  /((.r,  -  7)2  +  x|)_1/2/(0  <xx  <  1)7(0  <x2<  1)  dx^xa 

=  /(O.O)/1  /_'  ((x,  -  7)2  +  AY1'2  dx2dxx. 


This  last  integral  can  easily  be  verified  finite,  and  again  similar  reasoning  shows 
that  VHo[R2\\X  -7||-1]  <  00.  Further,  since  P(X  G  D')  <  (4tt/(0))V2,  we  have 
P(X  £  D')  <  k2d  since  d  <  1.  Similar  arguments  can  be  used  to  show  (C3)  holds  as 
well. 

To  show  (CI)  can  reasonably  be  attained,  we  utilize  results  pertaining  to  the 
continuity  and  differentiability  of  integrals,  for  example  (Randies  h  Wolfe,  1979, 
Theorems  A. 2. 3  and  A. 2. 4,  pp.  417-418).  First  observe  that 

/  h{xux2,z  -  7)/(z)  dz  =   I  h{xux2,z)f(z  +  7)  dz, 

where  h  is  the  indicator  of  whether  X\  and  x2  are  on  opposite  sides  of  the  line  formed 
by  z  —  7  and  the  origin.  This  integral  is  a  continuous  function  of  7  in  a  neighborhood 
of  the  origin,  since  we  can  bound  the  integrand  by  f(z),  with  /  f(z)dz  =  1  <  00. 
Further,  if  the  elements  of  V/(z  +  7)  can  be  bounded  by  an  integrable  function  not 
depending  on  7,  then 

V  /  h{xux2,z)f(z  +7)  dz  =  J  h(x1,x2,z)Vf(z  +  7)  dz 

will  exist  and  be  continuous.  Now 

/  /  cos  br  /  h(X]  -  -y,x2  -  7,z  -  ~f)f(z)  dz)  f(xi)f(x2)  dxxdx2  = 

J  J  cos  (tt     h(xi,x2,z)f(z  +  7)  dz)  f(xx  +  -f)f{x2  +  7)  dxxdx2, 


Figure  A.l.  Region  1 
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Figure  A. 2.  Region  2 
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Figure  A. 3.  Region  3 

where  the  integrand  is  a  continuous  function  of  7  is  the  mean  function  ^vin(l)-  By 
similar  reasoning,  V//vln(7)  is  given  by 

V  /  /  cos  In  J  h(xux2,z)f{z  +  7)  dz)  f(xj  +  ~f)f(x2  +  7)  dx1dx2  = 
/  /  sin  (tt     h(xux2,z)f(z  +7)  dz)  ■  [tt  j  h(xux2,z)Vf(z  +  7)  dz)   dxxdx2 
+  2  /  /  cos  U  I  h(xux2,z)f(z  +  7)  dz)  V/(a;1  +j)f(x2  +  7)  dxxdx2 

will  exist  and  be  continuous  if  we  can  bound  the  components  of /(z+7)  and  V/(z+7) 
by  integrable  functions  not  depending  on  7.  These  conditions  on  /  are  satisfied  in 
the  case  of  the  normal  distribution  among  others.  For  example,  letting  f(x,  y)  denote 
the  bivariate  standard  normal  density,  we  can  define  the  dominating  function  g  as 
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follows: 


g{x,y)  =  < 


/(0,0) 
/(0,y  +  l) 
/(0,y-l) 
/(a: +  1,0) 
/(x-1,0) 
/(x  +  l,y-l) 
/(x  +  l,y  +  l) 
/(x-l,y-l) 
/(x-l,y  +  l) 


-1  <  i  <  1 
-1  <y<  1 
-1  <  x  <  1 

—  oo  <  y  <  —  1 
-1  <  x  <  1 

1  <  y  <  oo 

—  oo  <  x  <  —  1 
-1  <y  <  1 

1  <  .r  <  oo 

— l  <y < l 

— oo  <  x  <  —  1 

1  <  y  <  oo 
— oo  <  x  <  — 1 

—  oo  <  y  <  —  1 
1  <  x  <  oo 

1  <  y  <  oo 
1  <  x  <  oo 
— oo  <  y  <  — 1 


Clearly,  this  function  dominates  /(x  +  7i,y  +  72)  for  ||*yj|  <  1  and  is  integrable  because 
the  tail  behavior  is  essentially  the  same  as  for  the  original  normal  random  variable. 
Because  V/(x  +  7)  =  —(a;  +  f)f{x  -f  7),  similar  reasoning  will  lead  to  an  integrable 
dominating  function.  Thus,  we  know  the  conditions  used  in  the  technical  lemmas  are 
achievable. 


APPENDIX   B 
CONTIGUITY 


We  need  to  establish  the  two  propositions 

•  Wn  =  Tn-a2/4  +  op(\) 

•  The  summands  L(Xt;  An)  are  UAN  (i.e.  L(X;;  An)  =  1  +op(l)) 

when  the  null  hypothesis  is  true  and  VWo  [Tn]  =  a2  <  oo.  Keep  in  mind  that  we  have 

A7l  =  n-1'2Ao,  where  Ao  is  a  positive  constant.  We  start  with  the  first  proposition. 
It  suffices  to  show  EHo  [Wn]  — »  — <r2/4  and  VHo  [Wn  —  Tn]  — »  0  as  n  — >  oo.  Let 
s(aj)  =  /^(as),  so  that  Vs(sc)  =  {Vf{x))/(2f1^(x)).  Then  we  can  express  EH(j  [Wn] 
as 


EHo[Wn]  =  2nEH\L1'2{X;An) 


2 


=  2nE 


//,, 


IAaJ-^^CA^X) 


=  -Kj 


dx. 


We  need  to  show  that  the  integral  converges  to  cr2/(4Ao).  Since 
\AAn\-V*s{A£x)  -  s(x)] 


lim 

A„^0 


A, 


(\/2)(rl+r2)s(x)  +  (Px)Vs(x), 
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and 


\A*n\-V*s(Al\x)  -  s(x) 


1     rA< 


-~  j   n(l/2)\At\-Vhr(A^P)s(x)  +  \At\-ll\A-'PA?x)'Vs{x)dt 

L\n  JO 

-J-  I"    \(l/2)\At\-^Hv(A;lP)s(x)  +  \Atr^2(A;lPA^x)'Vs(x)  ' 

Z_iTi   JO 


dt. 


we  nave 


/ 


\AAn\-^-2s(A-A[x)-s(x) 


-i  2 


A, 


dx 


<  -^J I  "  [(l/2)\At\-^HT(AT1P)s(x)  +  \At\-^2{A-1PA;1x)'Vs(x)]2dtdx 

~  a\C"  I  [^2)\A^1/2tT(^T1P)s(x)  +  \At\-1f2(AT1PAT1x)'Vs(x) 

=  J  [(l/2)(r1  +  r2)s(x)  +  (Px)'Vs(x)}2  dx, 


dx  dt 


so  that 


J™  EHo  [Wn]  =  -A2QJ[(l/2)(ri+r2)s(x)  +  {Px)'Vs{x)f  dx 

=  -^e„o[(l'(x;o))2 


We  need  to  show  EHo  [{L'(X;  0))2]  =  <r2/A2.  This  is  clear,  since 
EHo  [L'(X;  0)]  =  |(r,  +  r2)/(a;)  +  (Px)'Vf(x)dx 

\AA\-'f(A-A'x)-f(Xy 


=  /lim 

J    A-^0 


dx 


=  Um(l/ A)  J [\AA\-'  f(Al'x)  -  f(x)]dx 


lim(l/A)(l  -  1) 


0, 
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plies  that  EWo  [Tn]  =  0,  and  hence  that  a2  =  A^EWo  [(L'(X;0))2].  Now  since 
V//n  [Wn  -  Tn] 


=  4A^VWo 


<  4A2E„o 


4A2/ 


\AAn\-V2s(AllX±  _  j       r,  +  r2       (PX)'V,(i; 


An.s(X) 


AnS(X) 


MX) 


x    |    r1+r2    |    (PX)'Vs(X) 


s(X) 


\AAnr<HAl\x)  -sW  +  (r.+  ?)«(«)  +  ■ 


2 


<£c, 


^w0  f^n  ~~  ^"1  ""*  0  as  n  — >  oo.    To  show  the  second  proposition,  first  define  Z 
L'(X]0)  and 

\A*n\-*\f{A?nx)  -  f(x) 


Since 


EHq   [(Zn  -  Z)2]    =  I 


/(«)L 


da;. 


clearly  EHo  [(Zn  —  Z)2]  — >■  0  as  ?j  — >  oo,  thus  Zn  =  Op(l).    Then  L(X;  An)  =  1  + 
n-1/2A0Zn  =  1  +  0P(n"1/2)  =  1+  op(l)  as  needed. 


APPENDIX    C 

SIMULATION  STUDY 


CI      Source  Code 


Following  are  listings  of  some  of  the  programs  used  to  run  the  simulation  study. 
They  are  not  intended  to  be  complete  in  the  sense  that  they  can  stand  alone.  However, 
they  do  give  an  indication  of  the  logic  and  the  methods  used. 

C.l.l     interdir.c 

^include  <stdio.h> 
^include  <math.h> 
^include  "machine. h" 
^include  "general. h" 

/* 

*  Given  an  (n  x  p)  data  matrix  A,  InterdirectionCounts  will  compute 

*  the  interdirection  counts,  placing  them  in  the  upper  triangular 

*  (n  x  n)  matrix  COUNTS.    As  a  by  product,  the  return  value  is  the 

*  total  number  of  p-1  dimensional  hyperplanes  through  zero  that 

*  could  be  constructed  from  the  n  data  points. 

*/ 

int  InterdirectionCounts(MAT  *A,  MAT  *COUNTS) 

{ 

VEC  *DELTA; 

double  sum,  temp; 

int  *hyper,  *base,  *distinct,  *subs; 

int  pair [2]; 

int  first_hyper,  first_pair; 

int  n,  p,  nJiyper; 

int  i,  j; 

int  Perpendicular(MAT  *,  int  *,  VEC  *); 

double  OppositeSide(MAT  *,  int  *,  VEC  *); 


ins 
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it  =  A— >m; 
p  =  A-m; 

if  (P  >  1)  { 

DELTA  =  get_vec(p); 

hyper  =  (hit  *)malloc(p  *  sizeof(int)); 
base  =  (int  *)malloc(n  *  sizeof(int)); 
distinct  =  (int  *)malloc(n  *  sizeof(int)); 
subs  =  (int  *)malloc(n  *  sizeof(int)); 

A 

*  hyper[]  is  an  array  of  dimension  p-1  which  contains 

*  the  labels  of  the  points  (row  numbers  of  A)  in  the 

*  current  hyperplane. 

*/ 

nJiyper  =  0; 
firstJiyper  =  TRUE; 

while  (NextSubset(hyper,n,p—  1, firstJiyper))  { 
firstJiyper  =  FALSE; 

if  (Perpendicular(A,hyper,DELTA))  { 
++nJiyper; 

A 

*  distinctf]  is  an  array  of  dimension  n-(p-l)  which 

*  contains  the  labels  of  the  points  not  in  the  current 

*  hyperplane. 

*/ 

for  (i  =  1;  i  <  n;  ++i)  base[i-l]  =  TRUE; 
for  (i  =  1;  i  <  p-1;  ++i)  base[hyper[i-l]-l]  =  FALSE; 
for'(i=  l,j  =  l;i  <n;++i)  { 
if  (base[i-l])  { 
distinct[j  — 1]  =  i; 

} 
} 

A 

*  pair[]  is  an  array  of  dimension  2  which 

*  contains  2  of  the  labels  from  distinctf] 

*  This  loop  iterates  over  all  possible 


} 


*    pairs. 
*/ 

first_pair  =  TRUE; 

while  (NextSubset(subs,n— (p—  l),2,first_pair))  { 

first_pair  =  FALSE; 

pair[0]  =  distinct[subs[0]  — 1]; 

pair[l]  =  distinct[subs[l]  — 1]; 

temp  =  OppositeSide(A, pair, DELTA); 

sum  =  temp  +  m_entry(COUNTS,pair[0]-l,pair[l]-i; 

m  J5et(COUNTS,pair[0]- 1  ,pair[l]-l  ,sum); 

} 

} 

} 

freevec(  DELTA); 

free((void  *)hyper); 

free((void  *)base); 

free((void  *)distinct); 

free((void  *)subs); 

} 
else  { 

n -hyper  =  1; 

for  (i  =  1;  i  <  n  -  1;  ++i)  { 
for  (j  =  i  +  l;j  <  n;  ++j)  { 

temp  =  — (double)(sign(m_entry(A,i  — 1,0)) 

*sign(m_entry(A,j-l,0))  -  l)/2; 
m_set(COUNTS,i-l,j-l,temp); 
} 
} 
} 

return  nJiyper; 


A 

*  Computes  a  perpendicular  vector  (DELTA)  to  a  hyperplane 

*  defined  by  the  rows  of  A  indicated  by  the  row  labels  in 

*  hyperplane  and  the  origin. 

*  Returns  1  if  hyperplane  is  well  defined  and  0  if  points 

*  are  collinear. 

*/ 

int  Perpendicular(MAT  *A,  int  *hyper,  VEC  *DELTA) 
{ 


Ill 


MAT  *X,  *Y; 
VEC  *  DELTA  STAR; 
VEC  *TEMP1,  *TEMP2; 
int  i,  j),  ret_val; 

p  =  A-m; 

X  =  get_mat(p—  1  ,p); 
Y  =  get_mat(p— l,p— 1); 
DELTASTAR  =  get_vec(p-l); 
TEMPI  =  get_vec(p-l); 
TEMP2  =  get.vec(p); 

for  (i  =  1;  i  <  p  -  1;  ++i)  { 

get_row(A,hyper[i-l]-l,TEMP2); 

set_row(X,i-l,TEMP2); 

} 

sub_matrix(X,0,l,p— 2,p— 1,Y); 

ret_val  =  0; 

if  (fabs(det(Y))  >  MACHEPS)  { 
minverse(Y,Y); 
get_col(X,0,TEMPl); 
mv_mlt(Y,TEMPl,  DELTASTAR); 
for  (i  =  1;  i  <  p  -  1;  ++i)  { 

v_set(DELTA,i,v-entry(DELTASTAR,i-l)); 

} 
v_set(DELTA,0,-l); 

ret_val  =  1; 
} 

freemat(X); 

freemat(Y); 

freevec(DELTASTAR); 

freevec(TEMPl); 

freevec(TEMP2); 

return  ret_val; 


} 


/* 

*  Indicates  whether  (1)  or  not  (0)  the  pair  of  points  from  A 

*  whose  row  labels  are  in  pair[]  is  on  opposite  sides  of  the 

*  hyperplane  by  using  the  signs  of  the  inner  products  of 
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*  each  point  with  the  vector  (DELTA)  thru  the  origin  and 

*  perpendicular  to  the  current  hyperplane. 

*  Note:  actually  (1/2)  is  returned  if  one  of  the  pair  is 

*  on  the  hyperplane. 

V 

double  OppositeSide(MAT  *A,  int  *pair,  VEC  *DELTA) 

{ 

VEC  *TEMP; 

double  tempi,  temp2,  d; 

TEMP  =  get_vec(A->n); 

get  jow(A,pair[0]-l, TEMP); 
tempi  =  in_prod(TEMP,DELTA); 
get_row(A,pair[l]-l,TEMP); 

temP2  =  in_prod(TEMP,DELTA); 

d  =  — (double)(sign(templ)  *  sign(temp2)  —  l)/2; 

freevec(TEMP); 

return  d; 


C.1.2     quadrant.c 

^include  <stdio.h> 
^include  <math.h> 
^include  "general. h" 

I*  Given  pairs  of  vectors  (rows  of  Al  and  rows  of  A2)  and  estimates 

*  of  the  nuisance  location  parameters, 

*  Quadrant  1  returns  the  value  of  the  interdirection  quadrant 

*  statistic. 

*/ 

double  Quadrantl(MAT  *A1,  MAT  *A2,  VEC  *THETA1,  VEC  +THETA2) 

{ 

double  Q; 

int  n,  adj_n,  rl,  r2,  nhypsl,  nhyps2; 

int  i,  j,  zl index,  z2index; 

double  tempi,  temp2,  adjl,  adj2; 

MAT  *T1,  *T2; 

MAT  *C0UNTS1,  *C0UNTS2; 
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} 


n  =  A  1— mii; 

rl  =  AI-mi; 

r2  =  A2-mi; 

Tl  =  get_inat(n,rl); 

T2  =  get  _mat(n,r'2); 

C0UNTS1  =  get_mat(n,n); 

COUNTS2  =  get_mat(n,n); 

cp_mat(Al,Tl); 

cp_mat(A2,T2); 

A 

*  This  is  a  patch  to  adjust  the  statistic  if  one  of  the 

*  points  happens  to  be  equal  to  the  location  estimate 

*/ 

zlindex  =  Center(Tl,THETAl) 
z2index  =  Center(T2,THETA2) 
adj_n  =  n  —  (zlindex  >  0)  —  (z2index  >  0) 
+  (zlindex  >  0  kk  z2index  >  0); 

nhypsl  =  InterdirectionCounts(Tl,COUNTSl); 
nhyps2  =  InterdirectionCounts(T2,COUNTS2); 
adjl  =  (double)((n  -  rl  +  1)  *  (n  -  rl))/(n  *  (n  -  1)); 
adj2  =  (double)((n  -  r2  +  1)  *  (n  -  r2))/(n  *  (n  -  1)); 

Q  =  0.0; 

for  (i  =  1;  i  <  n  -  1;  ++i)  { 
for  (j  =  i+  1;  j  <  n;  ++j)  { 

tempi  =  m_entry(COUNTSl,i-l,j-l)/(adjl  *  nhypsl); 

temp2  =  m_entry(COUNTS2,i-l,j-l)/(adj2  *  nhyps2); 

Q  +=  cos(PI  *  tempi)  *  cos(PI  *  temp2); 

} 

} 
freemat(COUNTSl); 

freemat(COUNTS2); 

freemat(Tl); 

freemat(T2); 

return  (2  *  ((double)(rl  *  r2)/adj_n)  *  Q  +  (rl  *  r2)); 


/*    Quadrant2  is  a  computationally  less  intensive  approximation 

*  for  Quadrant  1,  which  must  be  given  additional  input  for 

*  the  number  of  iterations  and  tuning  parameter  to  be  used 
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*    in  the  estimation  of  covariance  matrices. 

V 

double  Quadrant2(MAT  *A1,  MAT  *A2,  VEC  *THETA1,  VEC  *THETA2, 
int,  iter,  double  tune) 

{ 

MAT  *SIGMA1,  *SIGMA2,  *SJNV1,  *S_INV2; 

VEC  *Y1,  *Y2,  *X1,  *X2; 

VEC  *TEMP1,  *TEMP2,  *TEMP3,  *TEMP4; 

int  n,  rl,  r2,  i,  j; 

double  d,  dl,  d2,  si,  s2,  tl,  t2,  nl,  n2; 

n  =  Al— >m; 
rl  =  Al-m; 
r2  =  A2^n; 

SIGMA  1  =  getjnat(rl,rl); 
SIGMA2  =  get_mat(r2,r2); 
SJNV1  =  get_mat(rl,rl); 
S  JNV2  =  get_mat(r2,r2); 

Yl  =  get_vec(rl); 
Y2  =  get_vec(rl); 
XI  =  get_vec(r2); 
X2  =  get_vec(r2); 

TEMPI  =  get_vec(rl) 
TEMP2  =  get_vec(rl) 
TEMP3  =  get_vec(r2) 
TEMP4  =  get_vec(r2) 

A 

*  Iterative  estimation  of  covariance  (M-estimates) 

*/ 

InitialEstimates(SIGMAl,THETAl,Al); 
Initial  Estimates(SIGMA2,THETA2,A2); 
for  (i  =  0;  i  <  iter;  ++i)  { 

NextEstimates(SIGMAl,THETAl,Al,tune); 

NextEstimates(SIGMA2,THETA2,A2,tune); 

} 

A 

*  Calculation  of  statistic 
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*/ 


m_inverse(SIGMAl,SJNVl); 
m_iuverse(SIGMA2,SJNV2); 

d  =  0.0; 

for(i  =  0;  i  <  n  -  1;  ++i)  { 

get_row(Al,i,Yl); 
v^sub(Yl,THETAl,Yl); 
vm_mlt(SJNVl,Yl,TEMPl); 
si  =  sqrt(in_prod(TEMPl,Yl)); 
get_row(A2,i,Xl); 
v^ub(Xl,THETA2,Xl); 
vnunlt(SJNV2,Xl,TEMP3); 
tl  =  sqrt(in_prod(TEMP3,Xl)); 
for(j  =  i  +  1;  j  <  n;  ++j)  { 

get_row(Al,j,Y2); 

v^ub(Y2,THETAl,Y2); 

vmjnlt(SJNVl,Y2,TEMP2); 

nl  =  in_prod(TEMPl,Y2); 

s2  =  sqrt(in_prod(TEMP2,Y2)); 

dl  -  (si  >  0  kk  s2  >  0  ?  nl/(sl  *  s2)  :  0); 

get_row(A2,j,X2); 

vj5ub(X2,THETA2,X2); 

vm_mlt(SJNV2,X2,TEMP4); 

n2  =  in_prod(TEMP3,X2); 

t2  =  sqrt(in_prod(TEMP4,X2)); 

d2  =  (tl  >  0  kk  t2  >  0  ?  n2/(tl  *  t2)  :  0); 

d  +=  dl  *  d2; 

} 

} 
freemat(SIGMAl); 

freemat(SIGMA2); 

freemat(SJNVl); 

freemat(SJNV2); 

freevec(Xl); 

freevec(X2); 

freevec(Yl); 

freevec(Y2); 

freevec(TEMPl); 

freevec(TEMP2); 

freevec(TEMP3); 

freevec(TEMP4); 
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return  (2  *  (double) (rl  *  r'2)/n  *  d  +  rl  *  ri 
} 


C.1.3     loop.c 

^include  <stdio.h> 
^include  <math.h> 
^include  "ranlib.h" 
^include  "general. h" 

#define  NOMINAL  0.05 

A 

*  This  is  the  main  routine  which  does  the  repetitions 

*  and  counts  the  number  of  times  each  of  the  competing 

*  statistics  falls  in  the  rejection  region. 

*/ 

char  *usage  =  "Usage:  %s  [p  or  n]  dim  N  reps  delta  nu  seedfile\n"; 
char  *cmd; 

main(int  argc,  char  **argv) 
{ 

double  delta,  nu; 

int  i,  j,  dim,  rl,  r2,  N,  reps; 

double  r,  t,  u,  s,  m,  n,  k,  v; 

double  Ql,  Q2,  PS,  L,  LF,  LX,  V,  VF,  VX,  U,  UF,  UX,  R,  RF; 

double  LF_NumDf,  LF_DenDf,  LX JDf,  VF_NumDf,  VF_DenDf,  VX_Df; 

double  UF_NumDf,  UFJDenDf,  RF_NumDf,  RF_DenDf; 

double  X,  Fl,  F2,  F3,  F4; 

VEC  *ER,  *EI; 

MAT  *A1,  *A2,  *B1,  *B2; 

MAT  *TEMP1,  *TEMP2; 

VEC  +THETA1,  +THETA2; 

int  cLX,  cLF,  cVX,  c.VF,  cUF,  clIX,  cRF,  cPS,  cQl,  cQ2; 

int  iLX,  iLF,  iVX,  iVF,  iUF,  iUX,  iRF,  iPS,  iQl,  iQ2; 

char  option; 

long  seedl,  seed2,  g; 
char  seedfile[80]; 
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FILE  *seeds; 

cmd  =  *argv; 

if  (argc  <  8)  bye(); 

if  (sscanf(argv[l],"%s",&;option)  ^  1)  bye(); 

if  (sscanf(argv[2],"%u",&dim)  ^  1)  bye(); 
if  (dim  <  1  ||  dim  >  MAX_DIM)  { 

printf(" Invalid  value  of  r\n"); 

exit(O); 
} 

if  (sscanf(argv[3],"%u",&N)  ^  1)  bye(); 
if  (N  <  2  *  r  +  1  ||  N  >  MAX_N)  { 

printf(" Invalid  value  of  N\n"); 

exit(O); 
} 

if  (sscanf(argv[4],"%u",&reps)  ^  1)  bye(); 
if  (reps  <  1)  { 

printf(" Invalid  value  of  reps\n"); 

exit(O); 
} 

if  (sscanf(argv[5],"%lf",&delta)  ^  1)  bye(); 

if  (delta  <  0.0  ||  delta  >  0.5)  { 
printf(" Invalid  value  of  delta\n"); 
exit(0); 

} 

if  (sscanf(argv[6],"%lf",&nu)  ^  1)  bye(); 
if  (nu  <  0.0)  { 

printf(" Invalid  value  of  nu\n"); 

exit(0); 

} 

if  (sscanf(argv[7],"%s",&seedfile)  ^  1)  bye(); 

/* 
*    Set  generator  number  and  current  seeds 

V 

seeds  =  fopen(seedfile,"rt"); 
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while(fscanf( seeds, "%ld  %ld  %ld",&g,&seedl,&seed2)  /  EOF); 

fclose(seeds); 

gscgn(l,&g); 

setall(seedl,seed2); 

rl  =  r2  =  dim; 
Al  =  get_mat(N,rl); 
A2  =  get_mat(N,r2); 
Bl  =  get_mat(N,rl); 
B2  =  get_mat(N,r2); 
TEMPI  =  get_mat(N,rl); 
TEMP2  =  get_mat(N,r2); 
THETA1  =get_vec(rl); 
THETA2  =  get_vec(r2); 
ER  =  get_vec(rl); 
EI  =  get_vec(rl); 

/*  Constants  needed  for  statistics  */ 

t  =  ((rl*rl  +  r2*r2  -  5.0  >  0)  ? 

pow((rl*rl*r2*r2  -  4.0)/(rl*rl  +  r2*r2  -  5.01,0.5)  :  1.0); 
k  =  N  -  (rl  +  r2  +  3)/2.0; 
u  =  (rl*r2  -  2)/4.0; 
s  =  min(rl,r2); 

m  =  (abs(rl  -  r2)  -  1.0)/2.0; 
n  =  (N  -  rl  -  r2  -  2.0)/2.0; 
r  =  max(rl,r2); 
v  =  N  -  r2  -  1; 

LF_NumDf  =  rl*r2; 

LF_DenDf  =  k*t  -  2.0*u; 

VF_NumDf  =  UF_NumDf  =  s*(2*m  +  s  +  1.0); 

VFJDenDf  =  s*(2*n  +  s  +  1.0); 

UF-DenDf  =  2*(s*n  +  1.0); 

RF_NumDf  =  r; 

RF_DenDf  =  v  -  r  +  r2; 

X  =  critchi(NOMINAL,rl*r2); 
Fl  =  critf(NOMINAL,LF_NumDf,LFJ)enDf); 
F2  =  critf(NOMINAL,VF_NumDf,VF_DenDf) 
F3  =  critf(NOMINAL,UF_NumDf,UF_DenDf) 
F4  =  critf(NOMINAL,RF_NumDf,RFJDenDf) 

cLX  =  cLF  =  cVX  =  cVF  =  cUF  =  cUX  =  cRF  =  cQl  =  cQ2  =  cPS  =  0; 
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if  (option  ==   '  p ' )  { 

printf("\n\n  LF        VF        UF        RF         LX        VX"); 

printf("         UX        PS        Ql         Q2\n"); 
} 

for(j  =  reps;  j  >  0; j)  { 

/* 

*  Generate  the  initial  random  vectors 

*/ 

rand_dat(Al,nu); 
rand_dat(A2,nu); 

A 

*  Create  dependence  via  model 

*/ 

sm_mlt(delta,Al,TEMPl); 
ms_mltadd(TEMPl,A2,(1.0  -  delta), Bl); 
sm_mlt(delta,A2,TEMP2); 
ms_mltadd(TEMP2,A  1,(1.0  -  delta),B2); 

/* 

*  Compute  statistics 

*/ 

/*  Eigenvalues  needed  to  compute  normal  theory  tests  */ 

S_Eigenvalues(Bl,B2,ER,EI); 

/*  Wilks'  lambda  */ 

L  =  1.0; 

for  (i  =  1;  i  <  rl;  ++i) 
L  *=  (1.0  -  v_entry(ER,i-l)); 

LF  =  (pow(L,-1.0/t)  -  1.0)*((double)LF_DenDf/LF_NumDf); 
LX  =  -k*log(L); 

/*  Pillai's  trace  */ 

V  =  0.0; 

for  (i  =  1;  i  <  rl;  ++i) 
V  +=  v_entry(ER,i-l); 
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VF  =  V/(s  -  V)*((double)VF_DenDf/VF_NumDf); 

VX  =  N*V; 

/*  Hotelling-Lawley  trace  */ 

U  =  0.0; 

for  (i  =  1;  i  <  rl;  ++i) 
U  +=  v_entry(ER,i-l)/(l  -  v_entry(ER,i-l)); 

UF  =  U/s*((double)UF_DenDf/UF_NumDf); 

UX  =  N*U; 

/*  Roy's  greatest  root  */ 

R  =  v_entry(ER,0); 
for  (i  =  2;  i  <  rl;  ++i) 
R  =  max(R,v_entry(ER,i-l)); 

RF  =  R/(1.0  -  R)*((double)RF_BenDf/RF_NumDf); 

/*  Puri-Sen  sign  statistic  */ 

PS  =  PuriSen(Bl,B2); 

/*  Quadrant  statistics  */ 

OjaMedian(Bl,THETAl); 
OjaMedian(B2,THETA2); 

Ql  =  Quadrantl(Bl,B2,THETAl,THETA2); 

Q2  =  Quadrant2(Bl,B2,THETAl,THETA2,5,2.0); 

/*  Reject?  */ 

iLF  =  (LF  >  Fl); 
iVF  =  (VF  >  F2); 
iUF  =  (UF  >  F3); 
iRF  =  (RF  >  F4); 

iLX  =  (LX  >  X); 
iVX  =  (VX  >  X); 
iUX  =  (UX  >  X); 
iPS  =  (PS  >  X); 
iQl  =  (Q1  >X); 
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iQ2  =  (Q2>X); 

A 

*  Update  counters 

*/ 

cLF  +  =  iLF; 
cVF  +=  iVF; 
cUF  +=  iUF; 
cRF  +=  iRF; 
cLX  +=  iLX; 
cVX  +=  iVX; 
cUX  +=  iUX; 
cPS  +=  iPS; 
cQl  +=iQl; 
cQ2  +=  iQ2; 

/* 

*  Print  value  of  statistics 

*/ 

if  (option  ==  'p  ' )  { 
printf("\n"); 
printf("%4d.  ",reps  -  j  +  1); 

printf("%6.2f",LF); 

if  (iLF)  printf("*");  else  printf("  "); 

printf("%6.2f",VF); 

if  (iVF)  printfC'*");  else  printf("  "); 

printf("%6.2f",UF); 

if  (iUF)  printf("*");  else  printf("  "); 

printf("%6.2f",RF); 

if  (iRF)  printf("*  ");  else  printf("    "); 

printf("%6.2f",LX); 

if  (iLX)  printf(M*");  else  printf("  "); 

printf("%6.2f",VX); 

if  (iVX)  printf ("*");  else  printf("  "); 

printf("%6.2f",UX); 

if  (iUX)  printf("*");  else  printf("  "); 

printf("%6.2f",PS); 

if  (iPS)  printf("*");  else  printf("  "); 

printf("%6.2f'\Ql); 

if  (iQl)  printf("*");  else  printf("  "); 

printf("%6.2f,l,Q2); 
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if  (iQ2)  printf  ("*"); 


} 
} 


A 

*    Print  empirical  powers 

V 

if  (option  ==   '  p' ) 
printf("\n\nEmpirical  Powers\n\n"); 


pr 

pr 
pr 
pr 
pr 

pr 
pr 
pr 
pr 
pr 
pr 


ntf("%4.2f", delta); 

ntf("%6.4f  ",(double)cLF/reps) 
ntf("%6.4f  ",(double)cVF/reps) 
ntf("%6.4f",(double)cUF/reps) 
ntf( "%6.4f  ' ' ,(double)cRF/reps) 


ntf("%6.4f  ^(doubleJcLX/reps); 
ntf("%6.4f  ",(double)cVX/reps); 
ntf("%6.4f  ",(double)cUX/reps); 
ntf("%6.4f",(double)cPS/reps); 
ntf("%6.4f  ",(double)cQl/reps); 
ntf("%6.4f\n",(double)cQ2/reps); 


A 
*    Save  current  generator  and  seeds 

V 

gscgn(0,&g); 
getsd(&seedl  ,&seed2); 
seeds  =  fopen(seedfile,"at"); 
fprintf(seeds,"%ld  %ld  %ld\n",g,seedl,seed2); 
fclose(seeds); 

return  0; 
} 

A  exit  with  message  */ 
int  bye() 

{ 

printf  (usage, cmd); 

exit(0); 
} 
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('.2      Results  in  Tabular  Form 

Following  are  tables  of  the  simulation  results.  There  are  two  separate  tables  on 
each  page  dividing  the  statistics  into  those  which  were  compared  to  an  F  critical  value 
and  those  which  were  compared  to  a  chi-square  critical  value. 


Table  C.l.  r  =  1. 


n 


30,  v  =  0.1,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0688 

0.0688 

0.0688 

0.0688 

0.05 

0.1544 

0.1544 

0.1544 

0.1544 

0.10 

0.3836 

0.3836 

0.3836 

0.3836 

0.20 

0.9380 

0.9380 

0.9380 

0.9380 

0.30 

0.9928 

0.9928 

0.9928 

0.9928 

Table  C.2.  r  =  1,  n  =  30,  v  =  0.1,  reps  =  2500 


A 

LX 

vx 

ux 

PS 

Qi 

Q2 

0.00 

0.0688 

0.0692 

0.0772 

0.0268 

0.0412 

0.0412 

0.05 

0.1544 

0.1560 

0.1756 

0.3188 

0.4132 

0.4132 

0.10 

0.3832 

0.3892 

0.4212 

0.6240 

0.7284 

0.7284 

0.20 

0.9380 

0.9408 

0.9488 

0.9312 

0.9648 

0.9648 

0.30 

0.9928 

0.9928 

0.9932 

0.9940 

0.9976 

0.9976 
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Table  C.3.  r  =  1,  n  =  30,  v  =  0.5,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0532 

0.0532 

0.0532 

0.0532 

0.05 

0.0860 

0.0860 

0.0860 

0.0860 

0.10 

0.2228 

0.2228 

0.2228 

0.2228 

0.20 

0.7944 

0.7944 

0.7944 

0.7944 

0.30 

0.9988 

0.9988 

0.9988 

0.9988 

Table  C.4.  r  -  1, 

n  =  30,  v  —  0.5,  reps  =  251 

)0 

A 

LX 

VX 

UX 

PS 

Qi 

Q2 

0.00 

0.0532 

0.0548 

0.0708 

0.0228 

0.0316 

0.0316 

0.05 

0.0860 

0.0884 

0.1096 

0.0464 

0.0612 

0.0612 

0.10 

0.2224 

0.2284 

0.2748 

0.1272 

0.1684 

0.1684 

0.20 

0.7944 

0.8008 

0.8388 

0.4672 

0.5604 

0.5604 

0.30 

0.9988 

0.9988 

0.9988 

0.8840 

0.9276 

0.9276 

Table  C.5.  r  =  1,  n  =  30,  v  =  1,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0500 

0.0500 

0.0500 

0.0500 

0.05 

0.0928 

0.0928 

0.0928 

0.0928 

0.10 

0.2280 

0.2280 

0.2280 

0.2280 

0.20 

0.7728 

0.7728 

0.7728 

0.7728 

0.30 

0.9996 

0.9996 

0.9996 

0.9996 
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Table  C 

.6.  r  =  1 

7).  =  30, 

v  —  1,  re 

ps  =  2500 

A 

LX 

VX 

ux 

PS 

Qi 

Q2 

0.00 

0.0500 

0.0528 

0.0664 

0.0320 

0.0460 

0.0460 

0.05 

0.0928 

0.0964 

0.1168 

0.0356 

0.0564 

0.0564 

0.10 

0.2280 

0.2328 

0.2708 

0.0792 

0.1108 

0.1108 

0.20 

0.7728 

0.7784 

0.8164 

0.3052 

0.3744 

0.3744 

0.30 

0.9996 

0.9996 

1.0000 

0.7660 

0.8308 

0.8308 

Table  C.7.  r  =  1,  n  =  30,  v  =  10,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0584 

0.0584 

0.0584 

0.0584 

0.05 

0.0772 

0.0772 

0.0772 

0.0772 

0.10 

0.2160 

0.2160 

0.2160 

0.2160 

0.20 

0.7688 

0.7688 

0.7688 

0.7688 

0.30 

0.9972 

0.9972 

0.9972 

0.9972 

Table  C.8.  r ----  1, 

n  =  30, 

j  =  10,  reps  =  2500 

A 

LX 

VX 

UX 

PS 

Qi 

Q2 

0.00 

0.0584 

0.0612 

0.0764 

0.0284 

0.0408 

0.0408 

0.05 

0.0772 

0.0804 

0.0980 

0.0264 

0.0428 

0.0428 

0.10 

0.2160 

0.2220 

0.2636 

0.0584 

0.0776 

0.0776 

0.20 

0.7688 

0.7752 

0.8076 

0.2072 

0.2648 

0.2648 

0.30 

0.9972 

0.9972 

0.9984 

0.5920 

0.6660 

0.6660 
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Table  C.9.  r  =  1,  n  =  30,  rf/  =  1,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0652 

0.0652 

0.0652 

0.0652 

0.05 

0.2508 

0.2508 

0.2508 

0.2508 

0.10 

0.4788 

0.4788 

0.4788 

0.4788 

0.20 

0.9448 

0.9448 

0.9448 

0.9448 

0.30 

0.9956 

0.9956 

0.9956 

0.9956 

Table  C. 

10.  r  =  1 

,  n  =  30, 

df  =  1,  reps  =  25 

30 

A 

LX 

VX 

ux 

PS 

Qi 

Q2 

0.00 

0.0652 

0.0672 

0.0772 

0.0252 

0.0392 

0.0392 

0.05 

0.2508 

0.2536 

0.2768 

0.0944 

0.1288 

0.1288 

0.10 

0.4784 

0.4840 

0.5200 

0.2516 

0.3128 

0.3128 

0.20 

0.9448 

0.9460 

0.9556 

0.6944 

0.7700 

0.7700 

0.30 

0.9956 

0.9960 

0.9964 

0.9632 

0.9788 

0.9788 

Table  C.ll.  r  =  1,  n  =  30,  df  =  5,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0524 

0.0524 

0.0524 

0.0524 

0.05 

0.0800 

0.0800 

0.0800 

0.0800 

0.10 

0.2208 

0.2208 

0.2208 

0.2208 

0.20 

0.7804 

0.7804 

0.7804 

0.7804 

0.30 

0.9984 

0.9984 

0.9984 

0.9984 
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Table  C. 

12.  r  =  1 

,  n  =  30, 

df  =  5,  reps  =  25 

30 

A 

LX 

vx 

UX 

PS 

Qi 

Q2 

0.00 

0.0524 

0.0540 

0.0640 

0.0244 

0.0384 

0.0384 

0.05 

0.0800 

0.0828 

0.1068 

0.0296 

0.0428 

0.0428 

0.10 

0.2208 

0.2268 

0.2612 

0.0852 

0.1116 

0.1116 

0.20 

0.7804 

0.7848 

0.8212 

0.3588 

0.4368 

0.4368 

0.30 

0.9984 

0.9984 

0.9992 

0.8224 

0.8860 

0.8860 

ble  C.l 

3.  r  =  2, 

n  =  30, 

v  =  0.1,  i 

-eps  =  25 

A 

LF 

VF 

UF 

RF 

0.00 

0.1152 

0.1072 

0.1196 

0.1960 

0.05 

0.1528 

0.1420 

0.1588 

0.2740 

0.10 

0.3672 

0.3620 

0.3696 

0.5148 

0.20 

0.9704 

0.9708 

0.9700 

0.9916 

0.30 

1.0000 

1.0000 

1.0000 

1.0000 

Table  C.l 

4.  r  =  2, 

n  =  30, 

;/  =  0.1,  i 

•eps  =  25 

00 

A 

LX 

VX 

UX 

PS 

Qi 

Q2 

0.00 

0.1156 

0.1088 

0.1620 

0.0852 

0.0628 

0.0576 

0.05 

0.1528 

0.1444 

0.2240 

0.2980 

0.3364 

0.3208 

0.10 

0.3672 

0.3648 

0.4912 

0.6304 

0.7564 

0.7356 

0.20 

0.9704 

0.9708 

0.9916 

0.9696 

0.9968 

0.9948 

0.30 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 

1.0000 
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Table  C.15.  r  =  2,  n  =  30,  u  =  0.5,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0516 

0.0452 

0.0560 

0.1652 

0.05 

0.0860 

0.0788 

0.0936 

0.2188 

0.10 

0.2084 

0.2016 

0.2124 

0.3952 

0.20 

0.8784 

0.8788 

0.8728 

0.9404 

0.30 

1.0000 

1.0000 

1.0000 

1.0000 

Table  C. 

6.  r  =  2, 

n  =  30, 

v  =  0.5,  i 

■eps  =  25 

00 

A 

LX 

vx 

ux 

PS 

Ql 

Q2 

0.00 

0.0516 

0.0456 

0.1136 

0.0828 

0.0604 

0.0504 

0.05 

0.0860 

0.0788 

0.1720 

0.1004 

0.0844 

0.0760 

0.10 

0.2088 

0.2024 

0.3296 

0.1992 

0.2096 

0.1984 

0.20 

0.8784 

0.8792 

0.9408 

0.6480 

0.7960 

0.7868 

0.30 

1.0000 

1.0000 

1.0000 

0.9828 

0.9984 

0.9988 

Table  C.17.  r  =  2,  n  =  30,  v  =  1,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0488 

0.0452 

0.0516 

0.1548 

0.05 

0.0772 

0.0696 

0.0824 

0.2220 

0.10 

0.2268 

0.2088 

0.2316 

0.4108 

0.20 

0.8648 

0.8632 

0.8644 

0.9232 

0.30 

0.9996 

0.9996 

0.9996 

1.0000 
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Table  C.18.  r  =  2,  n  =  30,  v  =  1,  reps  =  2500 


A 

LX 

VX 

ux 

PS 

Qi 

Q2 

0.00 

0.0488 

0.0456 

0.1124 

0.0844 

0.0556 

0.0456 

0.05 

0.0780 

0.0704 

0.1692 

0.0872 

0.0712 

0.0644 

0.10 

0.2268 

0.2120 

0.3560 

0.1540 

0.1748 

0.1536 

0.20 

0.8648 

0.8636 

0.9256 

0.5100 

0.6320 

0.6208 

0.30 

0.9996 

0.9996 

1.0000 

0.9300 

0.9876 

0.9892 

Table  C.19.  r  =  2,  n  =  30,  ;/  =  10,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0480 

0.0416 

0.0524 

0.1552 

0.05 

0.0740 

0.0680 

0.0788 

0.2116 

0.10 

0.2124 

0.2048 

0.2200 

0.4040 

0.20 

0.8700 

0.8680 

0.8672 

0.9312 

0.30 

1.0000 

1.0000 

1.0000 

1.0000 

Table  C.20.  r  ==  2 

n  =  30, 

v  =  10,  reps  =  2500 

A 

LX 

VX 

UX 

PS 

Qi 

Q2 

0.00 

0.0480 

0.0436 

0.1144 

0.0876 

0.0540 

0.0504 

0.05 

0.0740 

0.0688 

0.1608 

0.0968 

0.0756 

0.0616 

0.10 

0.2124 

0.2052 

0.3476 

0.1480 

0.1376 

0.1260 

0.20 

0.8704 

0.8712 

0.9336 

0.4048 

0.5004 

0.4900 

0.30 

1.0000 

1.0000 

1.0000 

0.8664 

0.9640 

0.9672 
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Table  C.21.  r  =  2,  n  =  30,  df  =  I,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.1136 

0.1076 

0.1168 

0.1700 

0.05 

0.3628 

0.3556 

0.3664 

0.4728 

0.10 

0.6192 

0.6172 

0.6208 

0.7132 

0.20 

0.9900 

0.9900 

0.9904 

0.9968 

0.30 

1.0000 

1.0000 

1.0000 

1.0000 

Table  C.22.  r  ==  2 

,  n  ■—  30, 

df  =  1,  reps  =  251 

JO 

A 

LX 

vx 

UX 

PS 

Qi 

Q2 

0.00 

0.1140 

0.1076 

0.1492 

0.0748 

0.0500 

0.0416 

0.05 

0.3628 

0.3564 

0.4360 

0.1892 

0.2108 

0.1860 

0.10 

0.6192 

0.6188 

0.7000 

0.4340 

0.5280 

0.4944 

0.20 

0.9900 

0.9900 

0.9964 

0.9160 

0.9740 

0.9680 

0.30 

1.0000 

1.0000 

1.0000 

0.9984 

1.0000 

0.9996 

Table  C.23.  r  =  2,  n  =  30,  df  =  5,  reps  =  2500 


A 

LF 

VF 

UF 

RF 

0.00 

0.0460 

0.0436 

0.0508 

0.1564 

0.05 

0.0852 

0.0796 

0.0900 

0.2208 

0.10 

0.2220 

0.2100 

0.2252 

0.4152 

0.20 

0.8896 

0.8900 

0.8884 

0.9432 

0.30 

1.0000 

1.0000 

1.0000 

1.0000 
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Table  C.24.  r  ==  2 

,  n  =  30, 

df  =  5,  reps  =  25' 

)0 

A 

LX 

vx 

UX 

PS 

Qi 

Q2 

0.00 

0.0460 

0.0440 

0.1064 

0.0804 

0.0492 

0.0448 

0.05 

0.0852 

0.0800 

0.1668 

0.1012 

0.0944 

0.0840 

0.10 

0.2220 

0.2104 

0.3592 

0.1856 

0.1796 

0.1664 

0.20 

0.8896 

0.8920 

0.9512 

0.6028 

0.7552 

0.7484 

0.30 

1.0000 

1.0000 

1.0000 

0.9696 

0.9980 

0.9984 

Table  C.25.  r  =  3,  n  =  30,  v  =  0.1,  reps  =  1 000 


A 

LF 

VF 

UF 

RF 

0.00 

0.135 

0.116 

0.149 

0.373 

0.05 

0.185 

0.162 

0.201 

0.466 

0.10 

0.375 

0.357 

0.385 

0.692 

0.00 

0.135 

0.116 

0.149 

0.373 

0.05 

0.185 

0.162 

0.201 

0.466 

Table  C.26.  r  =  3,  n  =  30,  v  =  0.1,  reps  =  1000 


A 

LX 

VX 

UX 

Qi 

Q2 

0.00 

0.136 

0.116 

0.257 

0.074 

0.058 

0.05 

0.189 

0.160 

0.355 

0.279 

0.223 

0.10 

0.375 

0.354 

0.601 

0.754 

0.697 

0.00 

0.136 

0.116 

0.257 

0.074 

0.058 

0.05 

0.189 

0.160 

0.355 

0.279 

0.223 
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Table  C.27.  r  =  3,  n  =  30,  v  =  Q.5,  reps  =  1000 


A 

LF 

VF 

UF 

RF 

0.00 

0.057 

0.039 

0.067 

0.356 

0.05 

0.088 

0.073 

0.105 

0.404 

0.10 

0.228 

0.205 

0.245 

0.620 

0.00 

0.057 

0.039 

0.067 

0.356 

0.05 

0.088 

0.073 

0.105 

0.404 

Table  C.28.  r  -- 

:  3,  n  = 

30,  v  = 

0.5,  reps  =  1000 

A 

LX 

VX 

ux 

Qi 

Q2 

0.00 

0.057 

0.039 

0.195 

0.059 

0.037 

0.05 

0.090 

0.073 

0.264 

0.106 

0.088 

0.10 

0.230 

0.205 

0.495 

0.262 

0.216 

0.00 

0.057 

0.039 

0.195 

0.059 

0.037 

0.05 

0.090 

0.073 

0.264 

0.106 

0.088 

Tab 


e  C.29 

.  r  =  3, 

n  =  30, 

i/  =  l, 

reps  = 

A 

LF 

VF 

UF 

RF 

0.00 

0.052 

0.038 

0.059 

0.358 

0.05 

0.079 

0.063 

0.096 

0.422 

0.10 

0.212 

0.200 

0.234 

0.601 

0.00 

0.052 

0.038 

0.059 

0.358 

0.05 

0.079 

0.063 

0.096 

0.422 

=  1000 
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Table  ( 

3.30.     7-    : 

=  3,  n  = 

30,  v  = 

:  1,  reps 

=  1000 

A 

LX 

VX 

ux 

Qi 

Q2 

0.00 

0.053 

0.038 

0.200 

0.075 

0.060 

0.05 

0.080 

0.063 

0.271 

0.084 

0.063 

0.10 

0.217 

0.198 

0.490 

0.190 

0.145 

0.00 

0.053 

0.038 

0.200 

0.075 

0.060 

0.05 

0.080 

0.063 

0.271 

0.084 

0.063 
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